apache / pinotLinks
Apache Pinot - A realtime distributed OLAP datastore
☆6,018Updated this week
Alternatives and similar repositories for pinot
Users that are interested in pinot are comparing it to the libraries listed below
Sorting:
- Apache Iceberg☆8,485Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,928Updated this week
- Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)☆12,485Updated this week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Updated 2 weeks ago
- The official home of the Presto distributed SQL query engine for big data☆16,637Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆6,082Updated last week
- Apache Drill is a distributed MPP query layer for self describing data☆2,006Updated this week
- Apache Parquet Java☆3,016Updated last week
- Apache Calcite☆5,062Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,560Updated this week
- Apache Kylin☆3,767Updated last month
- CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex…☆4,357Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,465Updated this week
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,147Updated 9 months ago
- Apache Parquet Format☆2,203Updated 2 weeks ago
- ☆1,686Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,607Updated this week
- Apache Avro is a data serialization system.☆3,214Updated this week
- Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.☆12,333Updated this week
- Dremio - the missing link in modern data☆1,464Updated 4 months ago
- Mirror of Apache Kudu☆1,898Updated this week
- Apache Geode☆2,351Updated last week
- Apache Hive☆6,000Updated this week
- Apache Impala☆1,265Updated last week
- A composable and fully extensible C++ execution engine library for data management systems.☆4,037Updated this week
- An extensible distributed system for reliable nearline data streaming at scale☆951Updated 2 months ago
- The live data layer for apps and AI agents Create up-to-the-second views into your business, just using SQL☆6,220Updated this week
- Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides…☆2,986Updated 2 months ago
- Web UI for PrestoDB.☆2,752Updated 4 years ago
- Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!☆11,625Updated this week