Content deleted Content added
Citation bot (talk | contribs) Add: s2cid, isbn, series. | Use this bot. Report bugs. | Suggested by Forlornacorn | #UCB_toolbar |
Citation bot (talk | contribs) Removed URL that duplicated identifier. | Use this bot. Report bugs. | Suggested by CorrectionsJackal | Category:Apache Software Foundation projects | #UCB_Category 21/111 |
||
(42 intermediate revisions by 17 users not shown) | |||
Line 1:
{{Short description|Open-source distributed data store}}
{{Infobox software
| name = Apache Pinot
| logo = [[File:
| screenshot =
| caption =
| author = {{ubl|Kishore Gopalakrishna|Xiang Fu}}
| developer = Apache Pinot
| latest release version =
| latest release date = {{Start date and age|df=yes|
| repo = [https://
| programming language = [[Java (programming language)|Java]]
| operating system = [[Cross-platform]]
Line 21:
}}
'''Apache Pinot''' is a [[Column-oriented DBMS|column-oriented]], [[open-source software|open-source]], [[Distributed database|distributed]] [[data store]] written in [[Java (programming language)|Java]]. Pinot is designed to execute [[Online analytical processing|OLAP]] queries with low latency.<ref>{{cite
Pinot was first created at [[LinkedIn]] after the engineering staff determined that there were no off the shelf solutions that met the social networking site's requirements like predictable low latency, data freshness in seconds, fault tolerance and scalability.<ref name="open-sourcing-pinot" />
== History ==
Pinot was started as an internal project at LinkedIn in 2013 to power a variety of user-facing and business-facing products. The first analytics product at LinkedIn to use Pinot was a redesign of the social networking site's feature that allows members to see who has viewed their profile in real-time. The project was open-sourced in June 2015 under an Apache 2.0 license and was donated to the Apache Software Foundation by LinkedIn in June 2019.<ref name="open-sourcing-pinot" /><ref name="pinot-joins-apache-foundation" />
== Architecture ==
[[File:Pinot Architecture.png|520x520px|thumb|alt=Architecture of Apache Pinot|Architecture diagram of Apache Pinot]]
Pinot uses [[Apache Helix]] for cluster management. Helix is embedded as an agent within the different components and uses [[Apache ZooKeeper]] for coordination and maintaining the overall cluster state and health. All Pinot servers and brokers are managed by Helix. Helix is a generic cluster management framework to manage partitions and replicas in a distributed system.
=== Query management ===
Line 40 ⟶ 39:
== Features ==
Pinot shares similar features with comparable OLAP datastores, such as [[Apache Druid]].<ref>{{cite book |last1=Ordonez |first1=Carlos |last2=Song |first2=Il-Yeol |last3=Anderst-Kotsis |first3=Gabriele |last4=Tjoa |first4=A. Min |last5=Khalil |first5=Ismail |title=Big Data Analytics and Knowledge Discovery: 21st International Conference, DaWaK 2019, Linz, Austria, August 26–29, 2019, Proceedings |date=2 October 2019 |publisher=Springer |isbn=978-3-030-27520-4 |page=170 |url=https://books.google.com/books?id=sf-pDwAAQBAJ&dq=Pinot+(data+store)+-wikipedia&pg=PA170 |language=en}}</ref><ref>{{cite book |last1=Uttamchandani |first1=Sandeep |title=The Self-Service Data Roadmap |date=10 September 2020 |publisher="O'Reilly Media, Inc." |isbn=978-1-4920-7520-2 |url=https://books.google.com/books?id=pEn8DwAAQBAJ&dq=Pinot+(data+store)+-wikipedia&pg=PT72 |language=en}}</ref> Like Druid, Pinot is a column-oriented database with various compression schemes such as [[
Pinot supports near real-time ingestion from streams such as [[Apache Kafka|Kafka]], [[AWS]] Kinesis and [[Batch processing|batch]] ingestion from sources such as [[Hadoop]], [[Amazon S3|S3]], [[Microsoft Azure|Azure]], [[Google Cloud Storage|GCS]]. Like
== See also ==
{{Portal|Free and open-source software}}
* [[List of column-oriented DBMSes]]
* [[Comparison of OLAP servers]]
== References ==
{{Reflist|30em}}
== External links ==
Line 59 ⟶ 60:
[[Category:Structured storage]]
[[Category:Free database management systems]]
[[Category:Free software programmed in Java (programming language)]]
[[Category:Database engines]]
[[Category:Big data products]]
|