rewrite-bigdata-in-rust / RBIRLinks
A collection of RBIR projects and posts for anyone interested in joining this journey.
☆259Updated this week
Alternatives and similar repositories for RBIR
Users that are interested in RBIR are comparing it to the libraries listed below
Sorting:
- Distributed pushdown cache for DataFusion☆189Updated this week
- Pure Rust Iceberg Implementation☆163Updated 11 months ago
- Apache DataFusion Ray☆207Updated 3 months ago
- TPC-H benchmark data generation in pure Rust☆110Updated this week
- Apache Paimon Rust The rust implementation of Apache Paimon.☆128Updated 2 months ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆230Updated this week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆205Updated this week
- Message queue and data streaming based on cloud native services.☆112Updated 2 months ago
- DataFusion TableProviders for reading data from other systems☆129Updated last week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆137Updated this week
- A User-Defined Function Framework for Apache Arrow.☆97Updated last week
- CMU-DB's Cascades optimizer framework☆401Updated 6 months ago
- A native Delta implementation for integration with any query engine☆236Updated this week
- ☆53Updated this week
- Batteries included CLI, TUI, and server implementations for DataFusion.☆158Updated 2 weeks ago
- Apache Iceberg☆1,008Updated last week
- Distributed SQL Query Engine in Python using Ray☆243Updated 9 months ago
- Comptaction runtime for Apache Iceberg.☆47Updated this week
- This is the companion repository for the book How Query Engines Work.☆394Updated 2 years ago
- Learn Data Lake From Storage Layer.☆45Updated 11 months ago
- Sqllogictest (dialect with extensions) parser and runner in Rust.☆195Updated 3 weeks ago
- Embeddable Aggregate Management System for Streams and Queries.☆93Updated 2 months ago
- View parquet files online☆163Updated this week
- Apache Spark Connect Client for Rust☆109Updated last month
- OmniPaxos is a distributed log implemented as a Rust library.☆203Updated 4 months ago
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆71Updated last week
- Boring Data Tool☆224Updated last year
- High-performance Stream Processing Framework. An alternative to Apache Flink.☆464Updated last year
- ☆33Updated 2 months ago
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆362Updated 11 months ago