Content deleted Content added
Added Riak to the list of distributed data stores |
m Dating maintenance tags: {{Fact}} |
||
(124 intermediate revisions by 96 users not shown) | |||
Line 1:
{{Short description|Computer network with mulitple nodes to store information}}
{{Essay-like|date=May 2012}}
{{Memory types}}
A '''distributed data store''' is a [[computer network]] where information is stored on more than one [[Node (networking)|node]], often in a [[Replication (computing)|replicated]] fashion.<ref>{{Citation
|author = Yaniv Pessach
|title = Distributed Storage
|edition = Distributed Storage: Concepts, Algorithms, and Implementations
|ol = 25423189M
}}</ref> It is usually specifically used to refer to either a [[distributed database]] where users store information on a ''number of nodes'', or a [[computer network]] in which users store information on a ''number of peer network nodes''.{{fact|date=August 2025}}
==Distributed databases==
[[Distributed
|
| publisher = Paper Trail
| title =
| quote = Although GFS provides Google with reliable, scalable distributed file storage, it does not provide any facility for structuring the data contained in the files beyond a hierarchical directory structure and meaningful file names. It’s well known that more expressive solutions are required for large data sets. Google’s terabytes upon terabytes of data that they retrieve from web crawlers, amongst many other sources, need organising, so that client applications can quickly perform lookups and updates at a finer granularity than the file level. [...] The very first thing you need to know about
| url = http://the-paper-trail.org/blog/?p=86
| archive-url = https://web.archive.org/web/20170716092550/http://the-paper-trail.org/blog/bigtable-googles-distributed-data-store
| archive-date = 2017-07-16
| url-status = dead
}}</ref> [[Amazon.com|Amazon]]'s [[Dynamo (storage system)|Dynamo]]<ref>{{cite web
| access-date = 2011-04-05
| author = Sarah Pidcock
| date = 2011-01-31
| page = 2/22
| publisher = WATERLOO
| title = Dynamo:
| quote = Dynamo: a highly available and scalable distributed data store
| url = http://www.cs.uwaterloo.ca/~kdaudjee/courses/cs848/slides/sarah1.pdf}}</ref>
and [[Azure Services Platform|Microsoft Azure Storage]].<ref>{{cite web|url=http://www.microsoft.com/windowsazure/features/storage/|title=Windows Azure Storage|website=[[Microsoft]] |date=2011-09-16|access-date=6 November 2011|url-status=dead|archive-url=https://web.archive.org/web/20111109002826/http://www.microsoft.com/windowsazure/features/storage/|archive-date=9 November 2011}}</ref>
As the ability of arbitrary querying is not as important as the [[availability]], designers of distributed data stores have increased the latter at an expense of consistency. But the high-speed read/write access results in reduced consistency, as it is not possible to
==Peer network node data stores==
In peer network data stores, the user can usually reciprocate and allow other users to use their computer as a storage node as well. Information may or may not be accessible to other users depending on the design of the network.
Most
Distributed data stores typically use an [[error detection and correction]] technique.
Line 32 ⟶ 43:
==Examples==
===Distributed non-relational databases===
{| class="wikitable sortable"
|-
! Product
! License
! [[High availability]]
! Notes
|-
| [[Apache Accumulo]]
| {{free|[[Apache License 2.0|AL2]]}}
|
|
|-
| [[Aerospike (database)|Aerospike]]
| {{free|[[GNU Affero General Public License|AGPL]]}}
|
|
|-
| [[Apache Cassandra]]
| {{free|[[Apache License 2.0|AL2]]}}
| {{yes}}
| formerly used by [[Facebook]]
|-
| [[Apache Ignite]]
| {{free|[[Apache License 2.0|AL2]]}}
|
|
|-
| [[Bigtable]]
| {{Proprietary}}
|
| used by [[Google]]
|-
| [[Couchbase]]
| {{free|[[Apache License 2.0|AL2]]}}
|
| used by [[LinkedIn]], [[PayPal]], and [[eBay]]
|-
| [[CrateDB]]
| {{free|[[Apache License 2.0|AL2]]}}
| {{yes}}
|
|-
| [[Apache Druid]]
| {{free|[[Apache License 2.0|AL2]]}}
|
| used by [[Netflix]], and [[Yahoo]]
|-
| [[Dynamo (storage system)|Dynamo]]
| {{Proprietary}}
|
| used by [[Amazon.com|Amazon]]
|-
| [[Container_Linux#ETCD|etcd]]
| {{free|[[Apache License 2.0|AL2]]}}
| {{yes}}
|
|-
| [[Hazelcast]]
| {{proprietary|[[Apache License 2.0|AL2]], Proprietary}}
|
|
|-
| [[HBase]]
| {{free|[[Apache License 2.0|AL2]]}}
| {{yes}}
| formerly used by Facebook
|-
| [[Hypertable]]
| {{free|[[GPL 2]]}}
|
| [[Baidu]]
|-
| [[MongoDB]]
| {{proprietary|[[Server Side Public License|SSPL]]}}
|
|
|-
| [[MySQL Cluster|MySQL NDB Cluster]]
| {{free|[[GPL 2]]}}
| {{yes}}
| SQL and NoSQL APIs
|-
| [[Riak]]
| {{free|[[Apache License 2.0|AL2]]}}
| {{yes}}
|
|-
|-
| [[Redis]]
| {{free|[[BSD License]]}}
| {{yes}}
|
|-
| [[ScyllaDB]]
| {{free|[[GNU Affero General Public License|AGPL]]}}
|
|
|-
| [[Voldemort (distributed data store)|Voldemort]]
| {{free|[[Apache License 2.0|AL2]]}}
|
| used by [[LinkedIn]]
|-
|}
===Peer network node data stores===
* [[BitTorrent (protocol)|BitTorrent]]
* [[Blockchain (database)]]
* [[Chord project]]
* [[Freenet]]
* [[GNUnet]]
* [[InterPlanetary File System|IPFS]]
* [[Mnet (Computer program)|Mnet]]
* [[Napster]]
* [[Network News Transfer Protocol|NNTP]] (the distributed data storage protocol used for [[Usenet]] news)
* Unity, of the software [[Perfect Dark (P2P)|Perfect Dark]]
* [[Share (P2P)|Share]]
* [[Siacoin]]
* DeNet
* [[Storage@home]]
* [[
* [[Winny]]
* [[ZeroNet]]
==See also==
* [[Cooperative storage cloud]]
* [[Data store]]
* [[Keyspace (distributed data store)|Keyspace]], the DDS [[Schema (database)|schema]]
* [[Distributed hash table]]
* [[Distributed cache]]
* [[Cyber Resilience]]
==References==
{{Reflist}}
[[Category:Data management]]
[[Category:Distributed data storage| ]]
[[Category:Distributed data stores| ]]
[[ja:分散ファイルシステム#分散データストア]]
|