rewrite-bigdata-in-rust / RBIRLinks
A collection of RBIR projects and posts for anyone interested in joining this journey.
☆289Updated this week
Alternatives and similar repositories for RBIR
Users that are interested in RBIR are comparing it to the libraries listed below
Sorting:
- Distributed pushdown cache for DataFusion☆284Updated this week
- Pure Rust Iceberg Implementation☆162Updated last year
- Apache DataFusion Ray☆219Updated last month
- TPC-H benchmark data generation in pure Rust☆180Updated 3 weeks ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆130Updated 5 months ago
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆217Updated last week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆253Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆149Updated 2 weeks ago
- DataFusion TableProviders for reading data from other systems☆147Updated last week
- Message queue and data streaming based on cloud native services.☆115Updated 5 months ago
- A native Delta implementation for integration with any query engine☆271Updated this week
- CMU-DB's Cascades optimizer framework☆404Updated 8 months ago
- Compaction runtime for Apache Iceberg.☆87Updated this week
- Apache Iceberg☆1,089Updated last week
- A User-Defined Function Framework for Apache Arrow.☆104Updated last month
- Batteries included CLI, TUI, and server implementations for DataFusion.☆165Updated 3 months ago
- ☆59Updated 2 weeks ago
- Rust based high-performance Apache Uniffle shuffle-server☆42Updated last week
- Distributed SQL Query Engine in Python using Ray☆244Updated last year
- Simple & Real-Time Ingestion into Apache Iceberg.☆176Updated last week
- Learn Data Lake From Storage Layer.☆45Updated last year
- Sqllogictest (dialect with extensions) parser and runner in Rust.☆204Updated 3 weeks ago
- This is the companion repository for the book How Query Engines Work.☆406Updated 2 years ago
- JSON support for DataFusion (unofficial)☆48Updated 2 weeks ago
- Embeddable Aggregate Management System for Streams and Queries.☆96Updated 5 months ago
- View parquet files online☆183Updated last week
- A portable embedded database using Arrow.☆1,183Updated last week
- Apache Spark Connect Client for Rust☆113Updated 3 months ago
- A BYOC option for Snowflake workloads☆101Updated this week
- Quickly view your data☆328Updated this week