rewrite-bigdata-in-rust / RBIRLinks
A collection of RBIR projects and posts for anyone interested in joining this journey.
☆278Updated this week
Alternatives and similar repositories for RBIR
Users that are interested in RBIR are comparing it to the libraries listed below
Sorting:
- Distributed pushdown cache for DataFusion☆245Updated this week
- Pure Rust Iceberg Implementation☆162Updated last year
- Apache DataFusion Ray☆217Updated 2 weeks ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆129Updated 4 months ago
- TPC-H benchmark data generation in pure Rust☆131Updated this week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆214Updated last week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆247Updated this week
- DataFusion TableProviders for reading data from other systems☆136Updated this week
- Message queue and data streaming based on cloud native services.☆113Updated 4 months ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆141Updated 2 weeks ago
- CMU-DB's Cascades optimizer framework☆404Updated 7 months ago
- Apache Iceberg☆1,059Updated this week
- A native Delta implementation for integration with any query engine☆246Updated last week
- A User-Defined Function Framework for Apache Arrow.☆102Updated 2 weeks ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆163Updated 2 months ago
- This is the companion repository for the book How Query Engines Work.☆397Updated 2 years ago
- Distributed SQL Query Engine in Python using Ray☆244Updated 10 months ago
- High-performance Stream Processing Framework. An alternative to Apache Flink.☆467Updated last year
- Sqllogictest (dialect with extensions) parser and runner in Rust.☆199Updated 2 months ago
- View parquet files online☆172Updated 3 weeks ago
- Comptaction runtime for Apache Iceberg.☆67Updated this week
- ☆56Updated this week
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆73Updated last week
- Sub-Second Postgres to Iceberg Mirroring☆124Updated this week
- New file format for storage of large columnar datasets.☆586Updated 2 weeks ago
- A portable embedded database using Arrow.☆1,136Updated last week
- Rust based high-performance Apache Uniffle shuffle-server☆39Updated this week
- High performance distributed cache system. Built by Rust.☆312Updated this week
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆365Updated last year
- Hybrid in-memory and disk cache in Rust☆1,111Updated this week