rewrite-bigdata-in-rust / RBIR
A collection of RBIR projects and posts for anyone interested in joining this journey.
☆223Updated this week
Alternatives and similar repositories for RBIR:
Users that are interested in RBIR are comparing it to the libraries listed below
- Apache DataFusion Ray☆160Updated this week
- Pure Rust Iceberg Implementation☆164Updated 6 months ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆147Updated this week
- Apache Paimon Rust The rust implementation of Apache Paimon.☆110Updated 4 months ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆96Updated this week
- A collection of demos showcasing how stream processing can be used to solve real-world problems.☆100Updated last week
- Message queue and data streaming based on cloud native services.☆104Updated last month
- The native Rust implementation for Apache Hudi, with Python API bindings.☆194Updated last week
- Apache Iceberg☆825Updated this week
- CMU-DB's Cascades optimizer framework☆397Updated last month
- A User-Defined Function Framework for Apache Arrow.☆86Updated last week
- Sqllogictest (dialect with extensions) parser and runner in Rust.☆188Updated this week
- An opinionated and batteries included DataFusion implementation.☆133Updated 2 weeks ago
- Embeddable Aggregate Management System for Streams and Queries.☆91Updated last month
- Distributed SQL Query Engine in Python using Ray☆243Updated 4 months ago
- Learn Data Lake From Storage Layer.☆45Updated 6 months ago
- DataFusion TableProviders for reading data from other systems☆81Updated this week
- A native Delta implementation for integration with any query engine☆188Updated this week
- Apache Spark Connect Client for Rust☆102Updated 2 weeks ago
- Boring Data Tool☆213Updated 11 months ago
- Shared Unit Raft☆80Updated 2 months ago
- A native storage format for apache arrow☆82Updated last year
- ☆42Updated this week
- This is the companion repository for the book How Query Engines Work.☆384Updated last year
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆63Updated this week
- ☆33Updated 2 years ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆76Updated 4 months ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆236Updated 9 months ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆119Updated last month
- High-performance Stream Processing Framework. An alternative to Apache Flink.☆445Updated last year