⚡ Fastest SQL ETL pipeline in a single C++ binary, built for stream processing, observability, analytics and AI/ML
☆2,174Mar 20, 2026Updated this week
Alternatives and similar repositories for proton
Users that are interested in proton are comparing it to the libraries listed below
Sorting:
- Distributed stream processing engine in Rust☆4,841Mar 12, 2026Updated last week
- Event streaming platform for agents, apps, and analytics. Continuously ingest, transform, and serve event data in real time, at scale.☆8,864Updated this week
- chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse☆2,641Updated this week
- Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!☆11,867Updated this week
- The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL☆6,251Updated this week
- GlareDB: A light and fast SQL database for analytics☆1,005Nov 14, 2025Updated 4 months ago
- A cloud native embedded storage engine built on object storage.☆2,793Updated this week
- Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.☆9,201Updated this week
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,174Updated this week
- Apache DataFusion SQL Query Engine☆8,516Updated this week
- Simple, Elastic-quality search for Postgres☆8,555Updated this week
- Python Stream Processing☆1,960Mar 27, 2025Updated 11 months ago
- DuckDB for streaming data☆749Sep 4, 2025Updated 6 months ago
- 🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊☆892Updated this week
- Postgres-native columnar storage extension☆3,018Feb 10, 2025Updated last year
- Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage☆3,026Updated this week
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,730Updated this week
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,311Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,993Updated this week
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Li…☆2,801Updated this week
- A composable and fully extensible C++ execution engine library for data management systems.☆4,079Updated this week
- Apache OpenDAL: One Layer, All Storage.☆4,951Updated this week
- Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.☆10,983Mar 13, 2026Updated last week
- Real-time analytics on Postgres tables☆1,941Dec 3, 2025Updated 3 months ago
- stream data generator☆15Jul 5, 2024Updated last year
- The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly…☆11,488Updated this week
- Apache Fluss is a streaming storage built for real-time analytics.☆1,826Updated this week
- Analytical database for data-driven Web applications 🪶☆511Feb 25, 2025Updated last year
- Tonbo is an embedded database for serverless and edge runtimes.☆1,504Mar 6, 2026Updated 2 weeks ago
- Embeddable stream processing engine based on Apache DataFusion☆374Dec 18, 2024Updated last year
- The open-source Observability 2.0 database. One engine for metrics, logs, and traces — replacing Prometheus, Loki & ES.☆6,045Updated this week
- DuckDB-powered Postgres for high performance apps & analytics.☆3,003Updated this week
- Hybrid in-memory and disk cache in Rust☆1,651Mar 5, 2026Updated 2 weeks ago
- 🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.☆5,182Mar 11, 2026Updated last week
- Fancy stream processing made operationally mundane☆8,609Updated this week
- GigAPI is a Timeseries lakehouse for real-time data and sub-second queries, powered by DuckDB OLAP + Parquet Query Engine, Compactor w/ C…☆383Oct 20, 2025Updated 5 months ago
- Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …☆3,213Mar 14, 2026Updated last week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,530Updated this week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,957Updated this week