rewrite-bigdata-in-rust/RBIR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rewrite-bigdata-in-rust/RBIR)

rewrite-bigdata-in-rust / RBIR

A collection of RBIR projects and posts for anyone interested in joining this journey.

☆327

Alternatives and similar repositories for RBIR

Users that are interested in RBIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Xuanwo / learn-data-lake-from-storage
View on GitHub
Learn Data Lake From Storage Layer.
☆44Aug 4, 2024Updated last year
apache / paimon-rust
View on GitHub
Apache Paimon Rust The rust implementation of Apache Paimon.
☆187Updated this week
apache / iceberg-rust
View on GitHub
Apache Iceberg
☆1,361Updated this week
arrow-udf / arrow-udf
View on GitHub
A User-Defined Function Framework for Apache Arrow.
☆114Updated this week
tisonkun / morax
View on GitHub
Message queue and data streaming based on cloud native services.
☆116Dec 1, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tonbo-io / tonbo
View on GitHub
Tonbo is an embedded database for serverless and edge runtimes.
☆1,592Jul 18, 2026Updated last week
apache / auron
View on GitHub
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…
☆1,780Updated this week
apache / datafusion-ray
View on GitHub
Apache DataFusion Ray
☆230May 15, 2026Updated 2 months ago
slatedb / slatedb
View on GitHub
A cloud native embedded storage engine built on object storage.
☆3,239Updated this week
JasonLi-cn / databend-comment
View on GitHub
databend source reading notes
☆22Jan 27, 2023Updated 3 years ago
apache / datafusion
View on GitHub
Apache DataFusion SQL Query Engine
☆9,041Updated this week
GlareDB / glaredb
View on GitHub
GlareDB: A light and fast SQL database for analytics
☆1,019Nov 14, 2025Updated 8 months ago
databendlabs / databend-docs
View on GitHub
Official repository for Databend documentation
☆17Updated this week
apache / datafusion-comet
View on GitHub
Apache DataFusion Comet Spark Accelerator
☆1,233Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
icelake-io / icelake
View on GitHub
Pure Rust Iceberg Implementation
☆162Aug 13, 2024Updated last year
apache / datafusion-ballista
View on GitHub
Apache DataFusion Ballista Distributed Query Engine
☆2,096Updated this week
cmu-db / optd-original
View on GitHub
CMU-DB's Cascades optimizer framework
☆405Jan 6, 2025Updated last year
foyer-rs / foyer
View on GitHub
Hybrid in-memory and disk cache in Rust
☆1,780Jul 11, 2026Updated 2 weeks ago
datafusion-contrib / ray-sql
View on GitHub
Distributed SQL Query Engine in Python using Ray
☆245Oct 2, 2024Updated last year
databendlabs / snowtree
View on GitHub
Review-Driven Safe AI Coding
☆55Apr 15, 2026Updated 3 months ago
databendlabs / databend
View on GitHub
Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.
☆9,402Updated this week
apache / opendal
View on GitHub
Apache OpenDAL: One Layer, All Storage.
☆5,268Updated this week
XiangpengHao / liquid-cache
View on GitHub
Pushdown cache for DataFusion
☆418Jun 13, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
crabml / crabml
View on GitHub
a fast cross platform AI inference engine 🤖 using Rust 🦀 and WebGPU 🎮
☆470Jan 4, 2025Updated last year
lakekeeper / lakekeeper
View on GitHub
Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
☆1,399Updated this week
vortex-data / vortex
View on GitHub
An extensible, state-of-the-art framework for columnar compression, and the fastest FOSS columnar file format. Formerly at @spiraldb, now…
☆3,105Updated this week
usamoi / saha
View on GitHub
OSPP 2022 Project: String Adaptive Hash Table for Databend
☆19Sep 15, 2022Updated 3 years ago
sjrusso8 / spark-connect-rs
View on GitHub
Apache Spark Connect Client for Rust
☆116Jun 10, 2025Updated last year
sundy-li / strawboat
View on GitHub
A native storage format for apache arrow
☆83Oct 18, 2023Updated 2 years ago
risingwavelabs / risingwave
View on GitHub
Event streaming platform for agentic AI. Continuously ingest, transform, and serve event streams in real time, at scale.
☆9,195Updated this week
silentsokolov / dbt-databend
View on GitHub
The Databend plugin for dbt (data build tool)
☆12Mar 17, 2023Updated 3 years ago
ArroyoSystems / arroyo
View on GitHub
Distributed stream processing engine in Rust
☆4,975Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
substrait-io / substrait
View on GitHub
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,537Updated this week
apache / hudi-rs
View on GitHub
The native Rust implementation for Apache Hudi, with C++ & Python API bindings.
☆278Jun 26, 2026Updated last month
apache / arrow-rs
View on GitHub
Official Rust implementation of Apache Arrow
☆3,551Updated this week
risinglightdb / risinglight
View on GitHub
An educational OLAP database system.
☆1,836Aug 10, 2025Updated 11 months ago
facebookincubator / nimble
View on GitHub
New and extensible file format for storage of large columnar datasets.
☆728Updated this week
spacewalkhq / raft-rs
View on GitHub
An understandable, fast and scalable Raft Consensus implementation
☆146Feb 9, 2026Updated 5 months ago
clflushopt / tpchgen-rs
View on GitHub
TPC-H benchmark data generation in pure Rust
☆252Updated this week