The official home of the Presto distributed SQL query engine for big data
☆16,699Apr 27, 2026Updated this week
Alternatives and similar repositories for presto
Users that are interested in presto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)☆12,744Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,980Updated this week
- Apache Flink☆25,943Apr 21, 2026Updated last week
- Apache Calcite☆5,112Updated this week
- A library that provides an embeddable, persistent key-value store for fast storage.☆31,657Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,186Apr 29, 2025Updated 11 months ago
- ClickHouse® is a real-time analytics database management system☆47,080Updated this week
- Apache Doris is an easy-to-use, high performance and unified analytics database.☆15,266Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆43,170Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆6,148Updated this week
- TiDB is built for agentic workloads that grow unpredictably, with ACID guarantees and native support for transactions, analytics, and vec…☆39,998Updated this week
- Apache Iceberg☆8,765Apr 21, 2026Updated last week
- Apache Pinot - A realtime distributed OLAP datastore☆6,069Updated this week
- Apache Hive☆5,975Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Apache Pulsar - distributed pub-sub messaging system☆15,214Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,759Updated this week
- Apache Kylin☆3,766Updated this week
- Apache Superset is a Data Visualization and Data Exploration Platform☆72,590Updated this week
- Web UI for PrestoDB.☆2,750May 20, 2021Updated 4 years ago
- Apache Drill is a distributed MPP query layer for self describing data☆2,014Apr 20, 2026Updated last week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆45,144Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,553Updated this week
- CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placemen…☆32,072Apr 21, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Apache Kafka - A distributed event streaming platform☆32,426Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,617Updated this week
- Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.☆12,654Updated this week
- A composable and fully extensible C++ execution engine library for data management systems.☆4,102Updated this week
- Scalable datastore for metrics, events, and real-time analytics☆31,452Updated this week
- Azkaban workflow manager.☆4,511Jul 3, 2024Updated last year
- Zipkin is a distributed tracing system☆17,432Apr 8, 2026Updated 2 weeks ago
- Mirror of Apache Kudu☆1,900Updated this week
- Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code☆14,244Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly…☆11,602Updated this week
- Real-time Query for Hadoop; mirror of Apache Impala☆34Dec 27, 2022Updated 3 years ago
- Apache HBase☆5,545Updated this week
- Free and Open Source, Distributed, RESTful Search Engine☆76,569Updated this week
- The Prometheus monitoring system and time series database.☆63,691Apr 21, 2026Updated last week
- Distributed reliable key-value store for the most critical data of a distributed system☆51,635Updated this week
- Vitess is a database clustering system for horizontal scaling of MySQL.☆20,928Updated this week