Apache Druid

This is an old revision of this page, as edited by Textractor (talk | contribs) at 18:37, 12 February 2014. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.


Druid is a column-oriented open-source distributed data store written in Java. Druid is designed to quickly ingest massive quantities of time-series data, making that data immediately available to queries.[1] This is sometimes referred to as real-time data.

Druid Database
Developer(s)Metamarkets
Stable release
0.6.52 / 6 January 2014 (2014-01-06)
Repository
Written inJava
Operating systemCross-platform
TypeColumn-oriented data store
LicenseGNU General Public License v2
Websitedruid.io

On the developer Q&A site Stackoverflow, Druid is described as "open-source infrastructure for real-time exploratory analytics on large datasets."[2] It is designed to ingest time-series data, chunking and compressing that data into column-based queryable segments.[3] Its architecture allows it to run as a cluster of specialized nodes to support a fault-tolerant architecture where data is stored redundantly and there are multiple members of each node type.[4]

History

Druid was created by real-time analytics company Metamarkets to use as a major part of its backend. The company open-sourced Druid in late 2012.[5] Since then, a number of organizations and companies, including Netflix[6], have integrated Druid into their backend technology.

References

  1. ^ Hemsoth, Nicole. "Druid Summons Strength in Real-Time", datanami, 08 November 2012
  2. ^ [1]Stackoverflow shorthand tag description.
  3. ^ Monash, Curt. "Metamarkets Druid Overview", DBMS2, 16 June 2012
  4. ^ Yang, Fangjin; Tschetter, Eric; Merlino, Gian; Ray, Nelson; Léauté, Xavier; Ganguli, Deep; Singh, Himadri. "Druid: A Real-time Analytical Data Store", Metamarkets, retrieved 6 February 2014
  5. ^ Higginbotham, Stacey. "Metamarkets open sources Druid, its in-memory database", GigaOM, 24 October 2012
  6. ^ Bae, Jae Hyeon; Yuan, Danny; Tonse, Sudhir. "Announcing Suro: Backbone of Netflix's Data Pipeline", Netflix, 9 December 2013