rewrite-bigdata-in-rust / RBIRLinks
A collection of RBIR projects and posts for anyone interested in joining this journey.
☆293Updated this week
Alternatives and similar repositories for RBIR
Users that are interested in RBIR are comparing it to the libraries listed below
Sorting:
- Distributed pushdown cache for DataFusion☆315Updated this week
- Apache DataFusion Ray☆222Updated last month
- Pure Rust Iceberg Implementation☆161Updated last year
- TPC-H benchmark data generation in pure Rust☆205Updated last week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆258Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆226Updated last week
- Apache Paimon Rust The rust implementation of Apache Paimon.☆132Updated 6 months ago
- DataFusion TableProviders for reading data from other systems☆157Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆155Updated 2 weeks ago
- A native Delta implementation for integration with any query engine☆277Updated this week
- CMU-DB's Cascades optimizer framework☆404Updated 10 months ago
- Apache Iceberg☆1,126Updated last week
- Message queue and data streaming based on cloud native services.☆115Updated 2 weeks ago
- Compaction runtime for Apache Iceberg.☆106Updated last week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆237Updated last week
- A User-Defined Function Framework for Apache Arrow.☆108Updated last month
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- This is the companion repository for the book How Query Engines Work.☆410Updated 2 years ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆177Updated this week
- View parquet files online☆197Updated this week
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.☆1,054Updated this week
- High-performance Stream Processing Framework. An alternative to Apache Flink.☆471Updated last year
- Rust based high-performance Apache Uniffle shuffle-server☆42Updated this week
- JSON support for DataFusion (unofficial)☆48Updated this week
- Quickly view your data☆332Updated last week
- ☆59Updated 3 weeks ago
- Embeddable Aggregate Management System for Streams and Queries.☆97Updated 6 months ago
- A portable embedded database using Arrow.☆1,200Updated last week
- Apache DataFusion Comet Spark Accelerator☆1,065Updated this week
- Learn Data Lake From Storage Layer.☆45Updated last year