Lockstep (computing): Difference between revisions

Content deleted Content added
added main article info
D6 (talk | contribs)
m fmt headline levels (to start with "==", WP Check Wikipedia check #7
Line 9:
The term "[[lockstep]]" originates in the prison usage, where it refers to the synchronized walking, in which the marchers walk as closely together as physically practical.
 
===Dual Modular Redundancy===
{{Main|Dual modular redundant}}
Where the computing systems are duplicated, but both actively process each step, it is difficult to arbitrate between them if their outputs differ at the end of a step. For this reason, it is common practice to run DMR systems as "master/slave" configurations with the slave as a "hot-standby" to the master, rather than in lockstep. Since there is no advantage in having the slave unit actively process each step, a common method of working is for the master to copy its state at the end of each step's processing to the slave. Should the master fail at some point, the slave is ready to continue from the previous known good step.
Line 15:
While either the lockstep or the DMR approach (when combined with some means of detecting errors in the master) can provide redundancy against hardware failure in the master, they do not protect against software failure. If the master fails because of a software error, it is highly likely that the slave - in attempting to repeat the execution of the step which failed - will simply repeat the same error and fail in the same way, an example of a [[common mode failure]].
 
===Triple Modular Redundancy===
{{Main|Triple modular redundancy}}
Where the computing systems are triplicated, it becomes possible to treat them as "voting" systems. If one unit's output disagrees with the other two, it is detected as having failed. The matched output from the other two is treated as correct.