Non-blocking algorithm: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 14:35, 28 October 2012 edit DavidCary (talk \| contribs) Extended confirmed users 7,223 edits mention implementation difficulties ← Previous edit		Latest revision as of 09:27, 20 August 2025 edit undo Bender the Bot (talk \| contribs) Bots 1,064,377 edits m →top: HTTP to HTTPS for Brown University Tag: AWB
(141 intermediate revisions by 80 users not shown)
Line 1: {{Short description\|Algorithm in a thread whose failure cannot cause another thread to fail}} ~~{{refimprove\|date=August 2010}}~~ {{Distinguish\|non-blocking I/O}} In [[computer science]], a '''non-blocking algorithm''' ensures that [[thread (software engineering)\|thread]]s competing for a shared [[resource (computer science)\|resource]] do not have their [[execution (computers)\|execution]] indefinitely postponed by [[mutual exclusion]]. A non-blocking algorithm is '''lock-free''' if there is guaranteed system-wide [[Resource starvation\|progress]]; '''wait-free''' if there is also guaranteed per-thread progress. In [[computer science]], an [[algorithm]] is called '''non-blocking''' if failure or [[Scheduling (computing)\|suspension]] of any [[thread (computing)\|thread]] cannot cause failure or suspension of another thread;<ref>{{cite book\|last1=Göetz\|first1=Brian\|last2=Peierls\|first2=Tim\|last3=Bloch\|first3=Joshua\|last4=Bowbeer\|first4=Joseph\|last5=Holmes\|first5=David\|last6=Lea\|first6=Doug\|title=Java concurrency in practice\|date=2006\|publisher=Addison-Wesley\|___location=Upper Saddle River, NJ\|isbn=9780321349606\|page=[https://archive.org/details/javaconcurrencyi00goet/page/41 41]\|url-access=registration\|url=https://archive.org/details/javaconcurrencyi00goet/page/41}}</ref> for some operations, these algorithms provide a useful alternative to traditional [[lock (computer science)\|blocking implementations]]. A non-blocking algorithm is '''lock-free''' if there is guaranteed system-wide [[Resource starvation\|progress]], and '''wait-free''' if there is also guaranteed per-thread progress. "Non-blocking" was used as a synonym for "lock-free" in the literature until the introduction of obstruction-freedom in 2003.<ref name=obs-free>{{cite conference\|last1=Herlihy\|first1=M.\|last2=Luchangco\|first2=V.\|last3=Moir\|first3=M.\|title=Obstruction-Free Synchronization: Double-Ended Queues as an Example\|conference=23rd [[International Conference on Distributed Computing Systems]]\|year=2003\|pages=522\|url=https://www.cs.brown.edu/people/mph/HerlihyLM03/main.pdf}}</ref> Literature up to the turn of the 21st century used "non-blocking" synonymously with lock-free. However, since 2003,<ref name=obs-free>{{cite journal\|last=Herlihy\|first=M.\|last2=Luchangco\|first2=V.\|last3=Moir\|first3=M.\|title=Obstruction-Free Synchronization: Double-Ended Queues as an Example\|journal=23rd [[International Conference on Distributed Computing Systems]]\|year=2003\|pages=522\|url=http://www.cs.brown.edu/people/mph/HerlihyLM03/main.pdf}}</ref> the term has been weakened to only prevent progress-blocking interactions with a [[Computer multitasking\|preemptive scheduler]]. In modern usage, therefore, an algorithm is ''non-blocking'' if the suspension of one or more threads will not stop the potential progress of the remaining threads. They are designed to avoid requiring a [[critical section]]. Often, these algorithms allow multiple processes to make progress on a problem without ever blocking each other. For some operations, these algorithms provide an alternative to [[lock (computer science)\|locking mechanism]]s. The word "non-blocking" was traditionally used to describe [[telecommunications network]]s that could route a connection through a set of relays "without having to re-arrange existing calls"{{Quote without source\|date=November 2024}} (see [[Clos network]]). Also, if the telephone exchange "is not defective, it can always make the connection"{{Quote without source\|date=November 2024}} (see [[nonblocking minimal spanning switch]]). == Motivation == {{~~main~~Main\|Lock (computer science)#~~The problems with locks~~Disadvantages\|l1=~~The problems~~Disadvantages ~~with~~of locks}} The traditional approach to multi-threaded programming is to use [[~~Lock~~lock (computer science)\|locks]] to synchronize access to shared [[resource (computer science)\|resources]]. Synchronization primitives such as [[mutual exclusion\|mutexes]], [[Semaphore (programming)\|semaphores]], and [[critical section]]s are all mechanisms by which a programmer can ensure that certain sections of code do not execute concurrently, if doing so would corrupt shared memory structures. If one thread attempts to acquire a lock that is already held by another thread, the thread will block until the lock is free. Blocking a thread iscan be undesirable for many reasons. An obvious reason is that while the thread is blocked, it cannot accomplish anything. : Ifif the blocked thread ishad been performing a high-priority or [[real-time computing\|real-time]] task, it iswould be highly undesirable to halt its progress. Other problems are less obvious. Certain interactions between locks can lead to error conditions such as [[deadlock]], [[livelock]], and [[priority inversion]]. Using locks also involves a trade-off between coarse-grained locking, which can significantly reduce opportunities for [[parallel computing\|parallelism]], and fine-grained locking, which requires more careful design, increases locking overhead and is more prone to bugs. Other problems are less obvious. For example, certain interactions between locks can lead to error conditions such as [[Deadlock (computer science)\|deadlock]], [[livelock]], and [[priority inversion]]. Using locks also involves a trade-off between coarse-grained locking, which can significantly reduce opportunities for [[parallel computing\|parallelism]], and fine-grained locking, which requires more careful design, increases locking overhead and is more prone to bugs. Non-blocking algorithms are also safe for use in [[interrupt handler]]s: even though the [[Pre-emptive multitasking\|preempted]] thread cannot be resumed, progress is still possible without it. In contrast, global data structures protected by mutual exclusion cannot safely be accessed in a handler, as the preempted thread may be the one holding the lock. Unlike blocking algorithms, non-blocking algorithms do not suffer from these downsides, and in addition are safe for use in [[interrupt handler]]s: even though the [[Pre-emptive multitasking\|preempted]] thread cannot be resumed, progress is still possible without it. In contrast, global data structures protected by mutual exclusion cannot safely be accessed in an interrupt handler, as the preempted thread may be the one holding the lock. While this can be rectified by masking interrupt requests during the critical section, this requires the code in the critical section to have bounded (and preferably short) running time, or excessive [[interrupt latency]] may be observed.<ref name="monit">{{cite journal \| doi = 10.1145/358818.358824 \| url = http://research.microsoft.com/lampson/23-ProcessesInMesa/Abstract.html \| title = Experience with Processes and Monitors in Mesa \| author = Butler W. Lampson \| author-link = Butler W. Lampson \|author2=David D. Redell \|author2-link=David D. Redell \| journal = Communications of the ACM \| volume = 23 \| issue = 2 \| pages = 105–117 \|date=February 1980\| citeseerx = 10.1.1.142.5765 \| s2cid = 1594544 }}</ref> ~~== Implementation ==~~ A lock-free data structure can be used to improve performance. With few exceptions, non-blocking algorithms use [[Linearizability\|atomic]] [[read-modify-write]] primitives that the hardware must provide, the most notable of which is [[Compare-and-swap\|compare and swap (CAS)]]. [[Critical section]]s are almost always implemented using standard interfaces over these primitives. Until recently, all non-blocking algorithms had to be written "natively" with the underlying primitives to achieve acceptable performance. However, the emerging field of [[software transactional memory]] promises standard abstractions for writing efficient non-blocking code. A lock-free data structure increases the amount of time spent in parallel execution rather than serial execution, improving performance on a [[multi-core processor]], because access to the shared data structure does not need to be serialized to stay coherent.<ref> <ref name=lightweight-transactions>{{cite journal\|last=Harris\|first=Tim\|last2=Fraser\|first2=Keir\|title=Language support for lightweight transactions\|journal=ACM SIGPLAN Notices\|date=26 November 2003\|volume=38\|issue=11\|pages=388\|doi=10.1145/949343.949340\|url=http://research.microsoft.com/en-us/um/people/tharris/papers/2003-oopsla.pdf}}</ref><ref name=composable-memory-transactions>{{cite book\|last=Harris\|first=Tim\|last2=Marlow\|first2=S.\|last3=Peyton-Jones\|first3=S.\|last4=Herlihy\|first4=M.\|title=Proceedings of the 2005 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '05 : Chicago, Illinois\|year=2005\|publisher=ACM Press\|___location=New York, NY\|isbn=1-59593-080-9\|pages=48–60\|url=http://doi.acm.org/10.1145/1065944.1065952\|chapter=Composable memory transactions\|date=June 15 - 17}}</ref> Guillaume Marçais, and Carl Kingsford. [https://web.archive.org/web/20140518060917/http://bioinformatics.oxfordjournals.org/content/27/6/764.abstract "A fast, lock-free approach for efficient parallel counting of occurrences of k-mers"]. Bioinformatics (2011) 27(6): 764-770. {{doi\|10.1093/bioinformatics/btr011}} [http://www.genome.umd.edu/jellyfish.html "Jellyfish mer counter"]. </ref> == Implementation == With few exceptions, non-blocking algorithms use [[Linearizability\|atomic]] [[read-modify-write]] primitives that the hardware must provide, the most notable of which is [[Compare-and-swap\|compare and swap (CAS)]]. [[Critical section]]s are almost always implemented using standard interfaces over these primitives (in the general case, critical sections will be blocking, even when implemented with these primitives). In the 1990s all non-blocking algorithms had to be written "natively" with the underlying primitives to achieve acceptable performance. However, the emerging field of [[software transactional memory]] promises standard abstractions for writing efficient non-blocking code.<ref name=lightweight-transactions>{{cite journal\|last1=Harris\|first1=Tim\|last2=Fraser\|first2=Keir\|title=Language support for lightweight transactions\|journal=ACM SIGPLAN Notices\|date=26 November 2003\|volume=38\|issue=11\|pages=388\|doi=10.1145/949343.949340\|url=http://research.microsoft.com/en-us/um/people/tharris/papers/2003-oopsla.pdf\|citeseerx=10.1.1.58.8466}}</ref><ref name=composable-memory-transactions>{{cite book\|last1=Harris\|first1=Tim\|last2=Marlow\|first2=S.\|last3=Peyton-Jones\|first3=S.\|last4=Herlihy\|first4=M.\|title=Proceedings of the 2005 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '05 : Chicago, Illinois\|publisher=ACM Press\|___location=New York, NY\|isbn=978-1-59593-080-4\|pages=48–60\|chapter=Composable memory transactions\|date=June 15–17, 2005\|doi=10.1145/1065944.1065952\|s2cid=53245159 }}</ref> Much research has also been done in providing basic [[data structure]]s such as [[stack (data structure)\|stacks]], [[Queue (data structure)\|queues]], [[Set (computer science)\|sets]], and [[hash table]]s. These allow programs to easily exchange data between threads asynchronously. Additionally, some non-blocking data structures are weak enough to be implemented without special atomic primitives. These exceptions include: * a single-reader single-writer [[Circular buffer\|ring buffer]] [[FIFO (computing and electronics)\|FIFO]], with a size which evenly divides the overflow of one of the available unsigned integer types, can unconditionally be [[Producer–consumer problem#Without semaphores or monitors\|implemented safely]] using only a [[memory barrier]] * [[Read-copy-update]] with a single writer and any number of readers. (The readers are wait-free; the writer is usually lock-free, until it needs to reclaim memory). * Read-copy-update with multiple writers and any number of readers. (The readers are wait-free; multiple writers generally serialize with a lock and are not obstruction-free). Several libraries internally use lock-free techniques,<ref> ~~Unfortunately, non-blocking algorithms -- like locking algorithms before them --~~ [https://libcds.sourceforge.net/ libcds] - C++ library of lock-free containers and safe memory reclamation schema ~~are susceptible to buggy implementations that at first appear to work,~~ </ref><ref> ~~until something happens that triggers the bug, causing data corruption.<ref>~~ [https://www.liblfds.org/ liblfds] - A library of lock-free data structures, written in C ~~Herb Sutter.~~ </ref><ref> ~~[http://www.drdobbs.com/article/print?articleId=210600279&siteSectionName=cpp "Lock-Free Code: A False Sense of Security"].~~ [http://concurrencykit.org Concurrency Kit] - A C library for non-blocking system design and implementation ~~Herb Sutter.~~ </ref> but it is difficult to write lock-free code that is correct.<ref name="A_FALSE_SENSE_OF_SECURITY">Herb Sutter. {{cite web \| url=http://www.drdobbs.com/article/print?articleId=210600279&siteSectionName=cpp \| title=Lock-Free Code: A False Sense of Security \| archive-url=https://web.archive.org/web/20150901211737/http://www.drdobbs.com/article/print?articleId=210600279&siteSectionName=cpp \| archive-date=2015-09-01 \|url-status=dead}}</ref><ref name="A_CORRECTED_QUEUE">Herb Sutter. {{cite web \| archive-url=https://web.archive.org/web/20081205072023/http://www.ddj.com/cpp/210604448 \| title=Writing Lock-Free Code: A Corrected Queue \| archive-date=2008-12-05 \| url-status=dead \| url=http://www.ddj.com/cpp/210604448 }}</ref><ref> ~~[http://www.ddj.com/cpp/184401930 "Writing Lock-Free Code: A Corrected Queue"]~~ Herb Sutter. [http://www.ddj.com/cpp/211601363 "Writing a Generalized Concurrent Queue"]. ~~Herb Sutter.~~ </ref><ref> ~~[http://www.ddj.com/cpp/211601363 "Writing a Generalized Concurrent Queue"]~~ Herb Sutter. [http://www.ddj.com/cpp/184401930 "The Trouble With Locks"]. ~~Herb Sutter.~~ </ref> ~~[http://www.ddj.com/cpp/184401930 "The Trouble With Locks"]~~ Non-blocking algorithms generally involve a series of read, read-modify-write, and write instructions in a carefully designed order. Optimizing compilers can aggressively re-arrange operations. Even when they don't, many modern CPUs often re-arrange such operations (they have a "weak [[consistency model]]"), unless a [[memory barrier]] is used to tell the CPU not to reorder. [[C++11]] programmers can use <code>std::atomic</code> in <code><atomic></code>, and [[C11 (C standard revision)\|C11]] programmers can use <code><stdatomic.h></code>, both of which supply types and functions that tell the [[compiler]] not to re-arrange such instructions, and to insert the appropriate memory barriers.<ref> Bruce Dawson. [https://randomascii.wordpress.com/2020/11/29/arm-and-lock-free-programming/ "ARM and Lock-Free Programming"]. </ref> == Wait-freedom == Wait-freedom is the strongest non-blocking guarantee of progress, combining guaranteed system-wide throughput with [[Resource starvation\|starvation]]-freedom. An algorithm is wait-free if every operation has a bound on the number of steps the algorithm will take before the operation completes.<ref name="awilliams"> Anthony Williams. [https://www.justsoftwaresolutions.co.uk//files/safety_off.pdf "Safety: off: How not to shoot yourself in the foot with C++ atomics"]. 2015. p. 20. </ref> This property is critical for real-time systems and is always nice to have as long as the performance cost is not too high. It was shown in the 1980s<ref name=imp>{{cite conference \|last=Herlihy \|first=Maurice P. \|conference=Proc. 7th Annual ACM Symp. on Principles of Distributed Computing \|isbn=0-89791-277-2 \|pages=276–290 \|doi=10.1145/62546.62593 \|title=Impossibility and universality results for wait-free synchronization \|year=1988\|doi-access=free }}</ref> that all algorithms can be implemented wait-free, and many transformations from serial code, called ''universal constructions'', have been demonstrated. However, the resulting performance does not in general match even naïve blocking designs. Several papers have since improved the performance of universal constructions, but still, their performance is far below blocking designs. Wait-freedom is the strongest non-blocking guarantee of progress, combining guaranteed system-wide throughput with [[Resource starvation\|starvation]]-freedom. An algorithm is wait-free if every operation has a bound on the number of steps the algorithm will take before the operation completes. This property is critical for real-time systems and is always nice to have as long as the performance cost is not too high. Several papers have investigated the difficulty of creating wait-free algorithms. For example, it has been shown<ref name=cond-sync>{{cite conference \|last1=Fich \|first1=Faith\|author1-link=Faith Ellen \|last2=Hendler \|first2=Danny \|last3=Shavit \|first3=Nir \|conference=Proc. 23rd Annual ACM Symp.on Principles of Distributed Computing (PODC) \|year=2004 \|isbn=1-58113-802-4 \|pages=80–87 \|doi=10.1145/1011767.1011780 \|title=On the inherent weakness of conditional synchronization primitives}}</ref> that the widely available atomic ''conditional'' primitives, [[Compare-and-swap\|CAS]] and [[Load-link/store-conditional\|LL/SC]], cannot provide starvation-free implementations of many common data structures without memory costs growing linearly in the number of threads. It was shown in the 1980s<ref name=imp>{{cite book\|last=Herlihy\|first=Maurice P.\|title=Proceedings of the Seventh Annual ACM Symposium on Principles of Distributed Computing : Toronto, Ontario, Canada\|year=1988\|publisher=Association for Computing Machinery\|___location=New York, N.Y.\|isbn=0-89791-277-2\|pages=276–290\|url=http://doi.acm.org/10.1145/62546.62593\|chapter=Impossibility and universality results for wait-free synchronization\|date=August 15-17}}</ref> that all algorithms can be implemented wait-free, and many transformations from serial code, called ''universal constructions'', have been demonstrated. However, the resulting performance does not in general match even naïve blocking designs. Several papers have since improved the performance of universal constructions, but still, their performance is far below blocking designs. However, these lower bounds do not present a real barrier in practice, as spending a cache line or exclusive reservation granule (up to 2 KB on ARM) of store per thread in the shared memory is not considered too costly for practical systems. Typically, the amount of store logically required is a word, but physically CAS operations on the same cache line will collide, and LL/SC operations in the same exclusive reservation granule will collide, so the amount of store physically required{{citation needed\|date=June 2014}} is greater.{{Clarification needed\|date=October 2024\|reason=Does this imply that there actually is a barrier in practice?}} Several papers have investigated the hardness of creating wait-free algorithms. For example, it has been shown<ref name=cond-sync>{{cite book\|last=Fich\|first=Faith\|last2=Hendler\|first2=Danny\|last3=Shavit\|first3=Nir\|title=Proceedings of the 23rd Annual ACM Symposium on Principles of Distributed Computing, PODC 2004 : St. John's, Newfoundland, Canada\|year=2004\|publisher=ACM Press\|___location=New York, NY\|isbn=1-58113-802-4\|pages=80–87\|url=http://doi.acm.org/10.1145/1011767.1011780\|chapter=On the inherent weakness of conditional synchronization primitives\|date=July 25 - 28}}</ref> that the widely available atomic ''conditional'' primitives, [[Compare-and-swap\|CAS]] and [[Load-Link/Store-Conditional\|LL/SC]], cannot provide starvation-free implementations of many common data structures without memory costs growing linearly in the number of threads. But in practice these lower bounds do not present a real barrier as spending a word per thread in the shared memory is not considered too costly for practical systems. ~~Until 2011, wait~~Wait-free algorithms were rare until 2011, both in research and in practice. However, in 2011 Kogan and [[Erez Petrank\|Petrank]]<ref name=wf-queue>{{cite ~~book~~conference \|~~last~~last1=Kogan \|~~first~~first1=Alex \|last2=Petrank \|first2=Erez \|~~title~~conference=~~Proceedings of the~~Proc. 16th ACM SIGPLAN ~~Symposium~~Symp. on Principles and Practice of Parallel Programming (PPOPP) ~~2011)~~\|year=2011~~\|publisher=ACM Press\|___location=San Antonio,~~ TX\|isbn=978-1-4503-0119-0 \|pages=~~223-234~~223–234 \|~~url~~doi=~~http://doi.acm.org/~~10.1145/1941553.1941585 \|~~chapter~~title=Wait-free queues with multiple enqueuers and dequeuers\|~~date~~url=~~February 12~~http://www.cs.technion.ac.il/~erez/Papers/wfquque-16ppopp.pdf}}</ref> presented a wait-free queue building on the [[Compare-and-swap\|CAS]] primitive, generally available on common hardware. Their construction ~~expands~~expanded the lock-free queue of Michael and Scott ,<ref name=lf-queue>{{cite ~~book~~conference \|~~last~~last1=Michael \|~~first~~first1=Maged \|last2=Scott \|first2=Michael \|~~title~~conference=~~Proceedings~~Proc. ~~of the Fifteenth~~15th Annual ACM ~~Symposium~~Symp. on Principles of Distributed Computing (PODC ~~1996~~) \|year=1996~~\|publisher=ACM Press\|___location=Philadelphia, Pennsylvania,~~ ~~USA~~\|isbn=0-89791-800-2 \|pages=~~267-275~~267–275 \|~~url~~doi=~~http://doi.acm.org/~~10.1145/248052.248106 \|~~chapter~~title=~~WaitSimple~~Simple, Fast, and Practical Non-Blocking and Blocking Concurrent Queue Algorithms\|~~date~~doi-access=~~May~~free ~~23-26~~}}</ref>, which is an efficient queue often used in practice. A follow-up paper by Kogan and Petrank<ref name=wf-fpsp>{{cite ~~book~~conference \|~~last~~last1=Kogan \|~~first~~first1=Alex \|last2=Petrank \|first2=Erez \|~~title~~conference=~~Proceedings~~Proc. of17th ~~the 17ACM~~ACM SIGPLAN ~~Symposium~~Symp. on Principles and Practice of Parallel Programming (PPOPP) ~~2012)~~\|year=2012~~\|publisher=ACM Press\|___location=New Orleans,~~ LA\|isbn=978-1-4503-1160-1 \|pages=~~141-150~~141–150 \|~~url~~doi=~~http://doi.acm.org/~~10.1145/2145816.2145835 \|~~chapter~~title=A ~~methodology~~method for creating fast wait-free data structures~~\|date=February 25-29~~}}</ref> provided a ~~methodology~~method for making wait-free algorithms fast and used this ~~methodology~~method to make the wait-free queue practically as fast as its lock-free counterpart. A subsequent paper by Timnat and Petrank<ref name=wf-simulation>{{cite conference \|last1=Timnat \|first1=Shahar \|last2=Petrank \|first2=Erez \|conference=Proc. 17th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming (PPOPP) \|year=2014 \| isbn=978-1-4503-2656-8 \| pages = 357–368 \| doi=10.1145/2692916.2555261 \| title= A Practical Wait-Free Simulation for Lock-Free Data Structures}}</ref> provided an automatic mechanism for generating wait-free data structures from lock-free ones. Thus, wait-free implementations are now available for many data-structures. Under reasonable assumptions, Alistarh, Censor-Hillel, and Shavit showed that lock-free algorithms are practically wait-free.<ref name=lf-wf>{{cite conference \|last1=Alistarh \|first1=Dan \|last2=Censor-Hillel \|first2=Keren \|last3=Shavit \|first3=Nir \|conference=Proc. 46th Annual ACM Symposium on Theory of Computing (STOC’14) \| year=2014 \| isbn=978-1-4503-2710-7 \| pages = 714–723 \| doi=10.1145/2591796.2591836 \| title=Are Lock-Free Concurrent Algorithms Practically Wait-Free?\|arxiv=1311.3200 }}</ref> Thus, in the absence of hard deadlines, wait-free algorithms may not be worth the additional complexity that they introduce. == Lock-freedom == Lock-freedom allows individual threads to starve but guarantees system-wide throughput. An algorithm is lock-free if, when the program threads are run for a sufficiently long time, at least one of the threads makes progress (for some sensible definition of progress). All wait-free algorithms are lock-free. In particular, if one thread is suspended, then a lock-free algorithm guarantees that the remaining threads can still make progress. Hence, if two threads can contend for the same mutex lock or [[spinlock]], then the algorithm is ''not'' lock-free. (If we suspend one thread that holds the lock, then the second thread will block.) An algorithm is lock-free if infinitely often operation by some processors will succeed in a finite number of steps. For instance, if {{var\|N}} processors are trying to execute an operation, some of the {{var\|N}} processes will succeed in finishing the operation in a finite number of steps and others might fail and retry on failure. The difference between wait-free and lock-free is that wait-free operation by each process is guaranteed to succeed in a finite number of steps, regardless of the other processors. Lock-freedom allows individual threads to starve but guarantees system-wide throughput. An algorithm is lock-free if it satisfies that when the program threads are run sufficiently long at least one of the threads makes ~~progress (for some sensible definition of progress). All wait-free algorithms are lock-free.~~ In general, a lock-free algorithm can run in four phases: completing one's own operation, assisting an obstructing operation, aborting an obstructing operation, and waiting. Completing one's own operation is complicated by the possibility of concurrent assistance and abortion, but is invariably the fastest path to completion. Line 60 ⟶ 95: == Obstruction-freedom == Obstruction-freedom is the weakest natural non-blocking progress guarantee. An algorithm is obstruction-free if at any point, a single thread executed in isolation (i.e., with all obstructing threads suspended) for a bounded number of steps will complete its operation.<ref name="awilliams" /> All lock-free algorithms are obstruction-free. Obstruction-freedom is possibly the weakest natural non-blocking progress guarantee. An algorithm is obstruction-free if at any point, a single thread executed in isolation (i.e., with all obstructing threads suspended) for a bounded number of steps will complete its operation. All lock-free algorithms are obstruction-free. Obstruction-freedom demands only that any partially completed operation can be aborted and the changes made rolled back. Dropping concurrent assistance can often result in much simpler algorithms that are easier to validate. Preventing the system from continually [[livelock\|live-locking]] is the task of a contention manager. ~~Obstruction-freedom is also called [[optimistic concurrency control]].~~ Some obstruction-free algorithms use a pair of "consistency markers" in the data structure. Processes reading the data structure first read one consistency marker, then read the relevant data into an internal buffer, then read the other marker, and then compare the markers. The data is consistent if the two markers are identical. Markers may be non-identical when the read is interrupted by another process updating the data structure. In such a case, the process discards the data in the internal buffer and tries again. == See also == * [[Deadlock (computer science)\|Deadlock]] * [[Concurrent data structure]] * [[Java ConcurrentMap#Lock-free atomicity]] * [[ABA problem]] * [[~~Compare-and-swap~~Liveness]] * [[Lock (computer science)]] * [[Concurrency control]] * [[Communicating sequential processes]] * [[Deadlock]] * [[JCSP]] * [[Linearizability]] * [[Load-Link/Store-Conditional]] * [[Lock (software engineering)]] * [[Memory barrier]] * [[Mutual exclusion]] * [[Pre-emptive multitasking]] * [[Priority inversion]] * [[Read-copy-update]] * [[Resource starvation]] * [[~~Room~~Non-lock ~~synchronization~~concurrency control]] * [[~~Software~~Optimistic ~~transactional~~concurrency ~~memory~~control]] * [[Partitioned global address space]] == References == {{Reflist\|30em}} ~~<references/>~~ == External links == * [http://preshing.com/20120612/an-introduction-to-lock-free-programming/ An Introduction to Lock-Free Programming] ~~{{linkfarm\|date=April 2012}}~~ * [http://tutorials.jenkov.com/java-concurrency/non-blocking-algorithms.html Non-blocking Algorithms] Article "[http://www.research.ibm.com/people/m/michael/podc-1996.pdf Simple, Fast, and Practical Non-Blocking and Blocking Concurrent Queue Algorithms]" by [[Maged M. Michael]] and [[Michael L. Scott]] Discussion "[http://groups.google.com/groups?group=comp.programming.threads&threadm=c2s1qn%24mrj%247%40newsserv.zdv.uni-tuebingen.de Communication between Threads, without blocking]" Survey "[http://www.audiomulch.com/~rossb/code/lockfree/ Some Notes on Lock-Free and Wait-Free Algorithms]" by [[Ross Bencina]] {{Javadoc:SE\|package=java.util.concurrent.atomic\|java/util/concurrent/atomic}} – supports lock-free and thread-safe programming on single variables [http://msdn2.microsoft.com/en-us/library/system.threading.interlocked.aspx System.Threading.Interlocked] - Provides atomic operations for variables that are shared by multiple threads (.NET Framework) [http://jail-ust.sourceforge.net/index.php?section=3&page=1 The Jail-Ust Container Library] [http://www.cl.cam.ac.uk/Research/SRG/netos/lock-free/ Practical lock-free data structures] Thesis "[http://www.adm.hb.se/~hsu/phd.pdf Efficient and Practical Non-Blocking Data Structures]" (1414 KB) by [[Per Håkan Sundell]] [http://www.mrtc.mdh.se/projects/warp/index.htm WARPing - Wait-free techniques for Real-time Processing] [http://www.cse.chalmers.se/~tsigas/papers/Yi-Thesis.pdf Non-blocking Synchronization: Algorithms and Performance Evaluation.] (1926 KB) by [[Yi Zhang]] "[http://dissertations.ub.rug.nl/faculties/science/2005/h.gao/ Design and verification of lock-free parallel algorithms]" by [[Hui Gao]] "[http://citeseer.ist.psu.edu/114960.html Asynchronous Data Sharing in Multiprocessor Real-Time Systems Using Process Consensus]" by Jing Chen and [[Alan Burns (professor)\|Alan Burns]] Discussion "[http://groups.google.com/groups?group=comp.programming.threads&threadm=ec1c3924.0410171103.568fa38a%40posting.google.com lock-free versus lock-based algorithms]" [http://atomic-ptr-plus.sourceforge.net/ Atomic Ptr Plus Project] - collection of various lock-free synchronization primitives [http://webpages.charter.net/appcore/ AppCore: A Portable High-Performance Thread Synchronization Library] - An Effective Marriage between Lock-Free and Lock-Based Algorithms [http://c2.com/cgi/wiki?WaitFreeSynchronization WaitFreeSynchronization] and [http://c2.com/cgi/wiki?LockFreeSynchronization LockFreeSynchronization] at the Portland Pattern Repository [http://www.hpl.hp.com/research/linux/atomic_ops/index.php4 Multiplatform library with atomic operations] [http://www.mgix.com/snippets/?LockFree A simple C++ lock-free LIFO implementation] [http://www.1024cores.net/home/lock-free-algorithms/introduction 1024cores] - a site devoted to lock-free, wait-free, obstruction-free and just scalable non-blocking synchronization algorithms and related topics [http://libcds.sourceforge.net/ libcds] - C++ library of lock-free containers and safe memory reclamation schema [http://concurrencykit.org Concurrency Kit] - A C library for non-blocking system design and implementation [http://www.unifiedsoftwaretechnologies.com Unified Software Technologies] - An ISV providing a completely lock-free web server and proprietary lock-free libraries {{DEFAULTSORT:Non-Blocking Algorithm}} Line 122 ⟶ 123: [[Category:Concurrency control]] [[Category:Concurrency control algorithms]] ~~[[Category:Operating system technology]]~~ ~~[[ar:خوارزمية غير مسدودة]]~~ ~~[[de:Nicht-blockierende Synchronisation]]~~ ~~[[he:אלגוריתם חסר נעילות]]~~ ~~[[ja:Lock-freeとWait-freeアルゴリズム]]~~ ~~[[pl:Synchronizacja nieblokująca]]~~ ~~[[ru:Неблокирующая синхронизация]]~~