rewrite-bigdata-in-rust / RBIRLinks
A collection of RBIR projects and posts for anyone interested in joining this journey.
☆256Updated this week
Alternatives and similar repositories for RBIR
Users that are interested in RBIR are comparing it to the libraries listed below
Sorting:
- Pure Rust Iceberg Implementation☆163Updated 10 months ago
- Distributed pushdown cache for DataFusion☆181Updated this week
- Apache DataFusion Ray☆202Updated 2 months ago
- DataFusion TableProviders for reading data from other systems☆125Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆134Updated last month
- Apache Paimon Rust The rust implementation of Apache Paimon.☆126Updated 2 months ago
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆200Updated this week
- Message queue and data streaming based on cloud native services.☆111Updated 2 months ago
- A native Delta implementation for integration with any query engine☆234Updated this week
- TPC-H benchmark data generation in pure Rust☆91Updated 2 weeks ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆225Updated this week
- Batteries included CLI, TUI, and server implementations for DataFusion.☆156Updated last month
- CMU-DB's Cascades optimizer framework☆400Updated 5 months ago
- Apache Iceberg☆990Updated this week
- Learn Data Lake From Storage Layer.☆45Updated 10 months ago
- Sqllogictest (dialect with extensions) parser and runner in Rust.☆194Updated 2 weeks ago
- View parquet files online☆159Updated this week
- Apache Spark Connect Client for Rust☆109Updated last week
- A User-Defined Function Framework for Apache Arrow.☆94Updated 2 weeks ago
- Boring Data Tool☆223Updated last year
- Embeddable Aggregate Management System for Streams and Queries.☆92Updated 2 months ago
- JSON support for DataFusion (unofficial)☆42Updated last week
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆361Updated 10 months ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 8 months ago
- New file format for storage of large columnar datasets.☆558Updated this week
- Rust based high-performance Apache Uniffle shuffle-server☆35Updated this week
- Shared Unit Raft☆82Updated 6 months ago
- High-performance Stream Processing Framework. An alternative to Apache Flink.☆464Updated last year
- Implementation of Apache ORC file format use Apache Arrow in-memory format☆43Updated 5 months ago
- Analytical database for data-driven Web applications 🪶☆484Updated 3 months ago