Distributed lock manager: Difference between revisions

Content deleted Content added
Rewrite clustering stuff, mention Redhat and Oracle DLMs, add headings
Add sections on OpenVMS implementation
Line 1:
A '''distributed lock manager''' (DLM) provides distributed applications with a means to synchronize their accesses to shared resources.
__NOTOC__<!--Not yet, anyhow -->
 
==Clusters==
Line 7 ⟶ 6:
 
[[Red Hat]] and [[Oracle Corporation|Oracle]] have both developed clustering software for [[Linux]]. Both projects include a DLM modelled on the venerable [[OpenVMS|VMS]] DLM. (''OCFS2'', the Oracle Cluster File System[http://oss.oracle.com/projects/ocfs2/] has been included[http://www.kernel.org/git/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=29552b1462799afbe02af035b243e97579d63350] in version 2.6.16 of the official [[Linux kernel]]. Red Hat's cluster software[http://sources.redhat.com/cluster/], including their DLM[http://sources.redhat.com/cluster/dlm/] is available as a kernel patch.)
 
==VMS Implementation==
 
VMS was the first widely-available operating system to implement a DLM. This became available in Version 4, although the user interface was the same as the single-processor lock manager that was first implemented in Version 3.
 
===Resources===
 
The DLM uses a generalised concept of a '''resource''', which is some entity to which shared access must be controlled. This may relate to a file, a record, or an area of shared memory, but can be anything that the application designer chooses. A hierarchy of resources may be defined, so that a number of levels of locking can be implemented. For instance, a hypothetical database might define a resource hierarchy as follows:
 
*Database
*Table
*Record
*Field
 
A [[process (computing)|process]] can then acquire locks on the database as a whole, and then on particular parts of the database. A lock must be obtained on a parent resource before a subordinate resource can be locked.
 
===Lock Modes===
 
A process running within a VMSCluster may obtain a lock on a resource. There are six lock modes that can be granted, and these determine the level of exclusivity of access to the resource. Once a lock has been granted, it is possible to convert the lock to a higher or lower level of lock mode. When all processes have unlocked a resource, the system's information about the resource is destroyed.
 
*Null Lock. Indicates interest in the resource, but does not prevent other processes from locking it. It has the advantage that the resource and its [[#Lock value block|lock value block]] are preserved, even when no processes are locking it.
 
*Concurrent Read. Indicates a desire to read (but not update) the resource. It allows other processes to read or update the resource, but prevents others from having exclusive access to it. This is usually employed on high-level resources, in order that higher levels of lock can be obtained on subordinate resources.
 
*Concurrent Write. Indicates a desire to read and update the resource. It also allows other processes to read or update the resource, but prevents others from having exclusive access to it. This is also usually employed on high-level resources, in order that higher levels of lock can be obtained on subordinate resources.
 
*Protected Read. This is the traditional ''share lock'', which indicates a desire to read the resource but prevents other from updating it. Others can however also read the resource.
 
*Protected Update. This is the traditional ''update lock'', which inidcates a desire to read and update the resource and prevents others from updating it. Others with Concurrent Read access can however read the resource.
 
*Exclusive. This is the traditional ''exclusive lock'' which allows read and update access to the resource, and prevents others from having any access to it.
 
The following truth table shows the compatibility of each lock mode with the others:
 
{| class="wikitable"
|-
| '''Mode''' || '''NL''' || '''CR''' || '''CW''' || '''PR''' || '''PU''' || '''EX'''
|-
| '''NL''' || Yes || Yes || Yes || Yes || Yes || Yes
|-
| '''CR''' || Yes || Yes || Yes || Yes || Yes || No
|-
| '''CW''' || Yes || Yes || Yes || No || No || No
|-
| '''PR''' || Yes || Yes || No || Yes || No || No
|-
| '''PU''' || Yes || Yes || No || No || No || No
|-
| '''EX''' || Yes || No || No || No || No || No
|}
 
===Obtaining a lock===
 
A process can obtain a lock on a resource by ''enqueueing'' a lock request. This is very similar to the [[QIO]] technique that is used to perform I/O. The enqueue lock request can either complete synchronously, in which case the process waits until the lock is granted, or asynchronously, in which case an [[Asynchronous System Trap|AST]] occurs when the lock has been obtained.
 
It is also possible to establish a ''blocking AST'', which is triggered when a process has obtained a lock that is preventing access to the resource by another process. The original process can then optionally take action to allow the other access (e.g. by demoting or releasing the lock).
 
===Lock value block===
 
A lock value block is associated with each resource. This can be read by any process that has obtained a lock on the resource (other than a null lock) and can be updated by a process that has obtained a protected update or exclusive lock on it.
 
It can be used to hold any information about the resource that the application designer chooses. A typical use is to hold a ''version number'' of the resource. Each time the associated entity (e.g. a database record) is updated, the holder of the lock increments the lock value block. When another process wishes to read the resource, it obtains the appropriate lock and compares the current lock value with the value it had last time the process locked the resource. It the value is the same, the process knows that the associated entity has not been updated since last time it read it, and therefore it is unnecessary to read it again. Hence, this technique can be used to implement various types of [[cache]] in a database or similar application.
 
===Deadlock detection===
 
When one or more processes have obtained locks on resources, it is possible to produce a situation where each is preventing another from obtaining a lock, and none of them can proceed. This is known as a ''deadly embrace'' or [[deadlock]].
 
A simple example is when Process 1 has obtained an exclusive lock on Resource A, and Process 2 has obtained an exclusive lock on Resource B. If Process 1 then tries to lock Resource B, it will have to wait for Process 2 to release it. But if Process 2 then tries to lock Resource A, both processes will wait forever for each other.
 
The OpenVMS DLM periodically checks for deadlock situations. In the example above, the second lock enqueue request of one of the processes would return with a deadlock status. It would then be up to this process to take action to resolve the deadlock &mdash; in this case by releasing the first lock it obtained.
 
==SSI Systems==
 
A DLM is also a key component of more ambitious [[Single-system image|single system image]] projects such as [[OpenSSI]].
 
==References==
 
[http://h71000.www7.hp.com/doc/82FINAL/4527/4527PRO.HTML |HP OpenVMS Systems Services Reference Manual]