apache / datafusion
Apache DataFusion SQL Query Engine
☆6,312Updated this week
Related projects ⓘ
Alternatives and complementary repositories for datafusion
- Official Rust implementation of Apache Arrow☆2,606Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,549Updated this week
- Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time E…☆7,052Updated this week
- The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.☆5,809Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,325Updated this week
- Distributed stream processing engine in Rust☆3,794Updated this week
- Extensible SQL Lexer and Parser for Rust☆2,798Updated this week
- 𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://data…☆7,867Updated this week
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆3,964Updated this week
- Apache OpenDAL: One Layer, All Storage.☆3,445Updated this week
- A modular implementation of timely dataflow in Rust☆3,299Updated last week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,205Updated this week
- A new arguably faster implementation of Apache Spark from scratch in Rust☆2,233Updated 2 years ago
- Transmute-free Rust library to work with the Arrow format☆1,063Updated 8 months ago
- A composable and fully extensible C++ execution engine library for data management systems.☆3,520Updated this week
- Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.☆2,640Updated this week
- Build Postgres Extensions with Rust!☆3,688Updated this week
- TensorBase is a new big data warehousing with modern efforts.☆1,442Updated 2 years ago
- An implementation of differential dataflow using timely dataflow on Rust.☆2,586Updated last week
- Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.☆1,292Updated this week
- Fast web applications through dynamic, partially-stateful dataflow☆5,003Updated 3 years ago
- High-performance runtime for data analytics applications☆2,996Updated 2 years ago
- Apache DataFusion Comet Spark Accelerator☆821Updated this week
- Apache Iceberg☆658Updated this week
- An open-source, cloud-native, unified time series database for metrics, logs and events with SQL/PromQL supported. Available on GreptimeC…☆4,341Updated this week
- GlueSQL is quite sticky. It attaches to anywhere.☆2,726Updated last week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆14,615Updated this week
- the champagne of beta embedded databases☆8,175Updated last month