Apache Pinot: Difference between revisions

Content deleted Content added
No edit summary
Updated references to journal based on citebot feedback.
Line 21:
}}
 
'''Pinot''' is a [[Column-oriented DBMS|column-oriented]], [[open-source software|open-source]], [[Distributed database|distributed]] [[data store]] written in [[Java (programming language)|Java]]. Pinot is designed to execute OLAP queries with low latency<ref>{{cite journal |last1=Im |first1=Jean-François |last2=Gopalakrishna |first2=Kishore |last3=Subramaniam |first3=Subbu |last4=Shrivastava |first4=Mayank |last5=Tumbde |first5=Adwait |last6=Jiang |first6=Xiaotian |last7=Dai |first7=Jennifer |last8=Lee |first8=Seunghyun |last9=Pawar |first9=Neha |last10=Li |first10=Jialiang |last11=Aringunram |first11=Ravi |title=Pinot: Realtime OLAP for 530 Million Users |journal=Proceedings of the 2018 International Conference on Management of Data |date=2018-05-27 |pagepages=583–594 |doi=10.1145/3183713.3190661 |url=https://dl.acm.org/doi/abs/10.1145/3183713.3190661#d13801648e1 |doipublisher=10.1145/3183713.3190661|s2cid=44083085Association for Computing Machinery}}</ref>. It is suited in contexts where fast analytics, such as aggregations, are needed on immutable data, possibly, with real-time data ingestion.<ref name="pinot-joins-apache-foundation">Pawar, Neha. [https://engineering.linkedin.com/blog/2019/03/pinot-joins-apache-incubator "Pinot Joins Apache Incubator"] {{Webarchive|url=https://web.archive.org/web/20190402090136/https://engineering.linkedin.com/blog/2019/03/pinot-joins-apache-incubator |date=2019-04-02 }}, ''LinkedIn Engineering'', 01 April 2019</ref> The name Pinot comes from the [[Pinot grape]] vines that are pressed into liquid that is used to produce a variety of different wines. The founders of the database chose the name as a metaphor for analyzing vast quantities of data from a variety of different file formats or streaming data sources.<ref name="open-sourcing-pinot">{{cite web |last1=Gopalakrishna |first1=Kishore |title=Open Sourcing Pinot: Scaling the Wall of Real-Time Analytics |url=https://engineering.linkedin.com/pinot/open-sourcing-pinot-scaling-wall-real-time-analytics |website=engineering.linkedin.com |publisher=LinkedIn |accessdate=3 September 2020 |archiveurl=https://web.archive.org/web/20150910081445/http://engineering.linkedin.com/pinot/open-sourcing-pinot-scaling-wall-real-time-analytics |archivedate=10 September 2015 |language=en}}</ref>
 
Pinot was first created at [[LinkedIn]] after the engineering staff determined that there were no off the shelf solutions that met the social networking site's requirements like predictable low latency, data freshness in seconds, fault tolerance and scalability.<ref name="open-sourcing-pinot" /> Pinot is used in production by technology companies such as [[Uber]],<ref name="pinot-at-uber">{{cite web |last1=Wang |first1=Haibo |title=Engineering SQL Support on Apache Pinot at Uber |url=https://eng.uber.com/engineering-sql-support-on-apache-pinot/ |website=Uber Engineering Blog |publisher=Uber |accessdate=3 September 2020 |date=15 January 2020}}</ref> [[Microsoft]],<ref name="pinot-joins-apache-foundation" /> and [[Factual|Factual]]<ref name="pinot-at-factual">{{cite web |last1=Melz |first1=Eric |title=Pinot @ Factual |url=https://medium.com/@ericmelz/pinot-factual-6118ebaf64b3 |website=Medium |accessdate=3 September 2020 |archiveurl=https://web.archive.org/web/20200903081722/https://medium.com/@ericmelz/pinot-factual-6118ebaf64b3 |archivedate= 3 September 2020 |language=en |date=17 May 2020}}</ref>.