rewrite-bigdata-in-rust / RBIRLinks
A collection of RBIR projects and posts for anyone interested in joining this journey.
☆306Updated this week
Alternatives and similar repositories for RBIR
Users that are interested in RBIR are comparing it to the libraries listed below
Sorting:
- Distributed pushdown cache for DataFusion☆357Updated last week
- Apache DataFusion Ray☆228Updated 3 months ago
- TPC-H benchmark data generation in pure Rust☆218Updated this week
- Pure Rust Iceberg Implementation☆162Updated last year
- Apache Paimon Rust The rust implementation of Apache Paimon.☆137Updated 8 months ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆266Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆231Updated this week
- DataFusion TableProviders for reading data from other systems☆162Updated this week
- A native Delta implementation for integration with any query engine☆305Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆162Updated last month
- Message queue and data streaming based on cloud native services.☆115Updated last month
- CMU-DB's Cascades optimizer framework☆404Updated last year
- Apache Iceberg☆1,193Updated this week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆292Updated last week
- View parquet files online☆209Updated 2 weeks ago
- This is the companion repository for the book How Query Engines Work.☆418Updated last week
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- Batteries included CLI, TUI, and server implementations for DataFusion.☆185Updated last month
- A User-Defined Function Framework for Apache Arrow.☆109Updated 3 months ago
- Apache Spark Connect Client for Rust☆118Updated 7 months ago
- Incremental view maintenance & query rewriting for materialized views in DataFusion☆67Updated 2 weeks ago
- Compaction runtime for Apache Iceberg.☆113Updated this week
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.☆1,110Updated this week
- ☆67Updated last week
- High-performance Stream Processing Framework. An alternative to Apache Flink.☆472Updated last year
- GlareDB: A light and fast SQL database for analytics☆986Updated last month
- JSON support for DataFusion (unofficial)☆55Updated 3 weeks ago
- Rust based high-performance Apache Uniffle shuffle-server☆55Updated last week
- Quickly view your data☆342Updated last week
- Sqllogictest (dialect with extensions) parser and runner in Rust.☆210Updated 3 weeks ago