Druid is a column-oriented open-source distributed data store written in Java. Druid is designed to quickly ingest massive quantities of time-series data, making that data immediately available to queries.[1] This is sometimes referred to as real-time data.
Druid Database | |
---|---|
Developer(s) | Metamarkets |
Stable release | 0.6.52
/ 6 January 2014 |
Repository | |
Written in | Java |
Operating system | Cross-platform |
Type | Column-oriented data store |
License | GNU General Public License v2 |
Website | druid |
On the developer Q&A site Stackoverflow, Druid is described as "open-source infrastructure for real-time exploratory analytics on large datasets."[2] It is designed to ingest time-series data, chunking and compressing that data into column-based queryable segments.[3] Its architecture allows it to run as a cluster of specialized nodes to support a fault-tolerant architecture where data is stored redundantly and there are multiple members of each node type.[4]
History
Druid was created by real-time analytics company Metamarkets to use as a major part of its backend. The company open-sourced Druid in late 2012.[5] Since then, a number of organizations and companies, including Netflix[6], have integrated Druid into their backend technology.
References
- ^ Hemsoth, Nicole. "Druid Summons Strength in Real-Time", datanami, 08 November 2012
- ^ [1]Stackoverflow shorthand tag description.
- ^ Monash, Curt. "Metamarkets Druid Overview", DBMS2, 16 June 2012
- ^ Yang, Fangjin; Tschetter, Eric; Merlino, Gian; Ray, Nelson; Léauté, Xavier; Ganguli, Deep; Singh, Himadri. "Druid: A Real-time Analytical Data Store", Metamarkets, retrieved 6 February 2014
- ^ Higginbotham, Stacey. "Metamarkets open sources Druid, its in-memory database", GigaOM, 24 October 2012
- ^ Bae, Jae Hyeon; Yuan, Danny; Tonse, Sudhir. "Announcing Suro: Backbone of Netflix's Data Pipeline", Netflix, 9 December 2013
External Links