apache / datafusionLinks
Apache DataFusion SQL Query Engine
โ7,709Updated this week
Alternatives and similar repositories for datafusion
Users that are interested in datafusion are comparing it to the libraries listed below
Sorting:
- Official Rust implementation of Apache Arrowโ3,121Updated this week
- ๐๐-๐ก๐ฎ๐๐ถ๐๐ฒ ๐๐ฎ๐๐ฎ ๐ช๐ฎ๐ฟ๐ฒ๐ต๐ผ๐๐๐ฒ. Open-source Snowflake alternative. Proven at petabyte scale with enterprise performance. Bโฆโ8,784Updated last week
- Apache DataFusion Ballista Distributed Query Engineโ1,836Updated last week
- Real-time event streaming platform. Streaming CDC, stream processing, low-latency serving, and Iceberg management.โ8,316Updated this week
- Distributed stream processing engine in Rustโ4,501Updated last week
- Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.โ6,108Updated this week
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vโฆโ5,320Updated last week
- Extensible SQL Lexer and Parser for Rustโ3,178Updated this week
- A native Rust library for Delta Lake, with bindings into Pythonโ2,938Updated this week
- Apache OpenDAL: One Layer, All Storage.โ4,411Updated this week
- A composable and fully extensible C++ execution engine library for data management systems.โ3,883Updated this week
- A modular implementation of timely dataflow in Rustโ3,504Updated 2 weeks ago
- Apache Icebergโ1,078Updated this week
- Build Postgres Extensions with Rust!โ4,150Updated 3 weeks ago
- TensorBase is a new big data warehousing with modern efforts.โ1,453Updated 3 years ago
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processโฆโ1,567Updated this week
- A new arguably faster implementation of Apache Spark from scratch in Rustโ2,241Updated 3 years ago
- Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.โ2,780Updated 2 weeks ago
- Open-source, cloud-native, unified observability database for metrics, logs and traces, supporting SQL/PromQL/Streaming. Available on Greโฆโ5,514Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.โ1,380Updated last week
- Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rustโ13,706Updated last week
- Distributed SQL database in Rust, written as an educational projectโ7,057Updated 3 weeks ago
- Distributed query engine providing simple and reliable data processing for any modality and scaleโ3,750Updated this week
- Transmute-free Rust library to work with the Arrow formatโ1,063Updated last year
- An implementation of differential dataflow using timely dataflow on Rust.โ2,798Updated 2 weeks ago
- Fastest library to load data from DB to DataFrames in Rust and Pythonโ2,403Updated 2 weeks ago
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analyticsโ15,929Updated this week
- Fast web applications through dynamic, partially-stateful dataflowโ5,170Updated 3 years ago
- Apache DataFusion Comet Spark Acceleratorโ1,033Updated this week
- the champagne of beta embedded databasesโ8,672Updated 3 months ago