Library for bringing distributed capabilities to Apache DataFusion
☆69Updated this week
Alternatives and similar repositories for datafusion-distributed
Users that are interested in datafusion-distributed are comparing it to the libraries listed below
Sorting:
- An experimental (work-in-progress) statically typed implementation of Apache Arrow☆28Feb 16, 2026Updated last week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆166Jan 14, 2026Updated last month
- Python Package for ducklake☆20Jun 5, 2025Updated 8 months ago
- Data Engineering framework written in Python based in Polars.☆14May 1, 2024Updated last year
- Tantivy directory implementation backed by object_store☆40Jan 22, 2024Updated 2 years ago
- A collection of resources about DataFusion☆17Nov 11, 2024Updated last year
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆269Updated this week
- ☆15Nov 10, 2025Updated 3 months ago
- Apache DataFusion Benchmarks☆23Dec 31, 2025Updated 2 months ago
- Apache Arrow database client for many databases.☆49Jan 20, 2026Updated last month
- Prototype implementation of zarr v3 in rust☆18Nov 20, 2023Updated 2 years ago
- Rust object_store crate☆226Updated this week
- Rust DataFusion Server☆25Feb 4, 2026Updated 3 weeks ago
- Incremental view maintenance & query rewriting for materialized views in DataFusion☆69Feb 3, 2026Updated 3 weeks ago
- Embeddable Aggregate Management System for Streams and Queries.☆107Nov 8, 2025Updated 3 months ago
- InfluxData's core functionality for InfluxDB Edge and IOx☆50Jan 28, 2026Updated last month
- Building block library for using Apache Arrow in Rust WebAssembly modules.☆28Feb 19, 2026Updated last week
- Integration between arrow-rs and extendr☆26Dec 15, 2025Updated 2 months ago
- Implementation of Zarr file format in Rust☆29Updated this week
- Read & decompress many chunks of files at high speed☆67May 5, 2025Updated 9 months ago
- Rust implementation of the FastLanes compression library☆163Updated this week
- Framework to build data pipelines declaratively☆94Dec 6, 2025Updated 2 months ago
- DataFusion TableProviders for reading data from other systems☆170Updated this week
- A Reproducible Untargeted Metabolomics Data Processing Pipeline☆11Mar 18, 2021Updated 4 years ago
- AIO Wialon is an async realisation of Python wrapper for Remote Api☆12Sep 16, 2025Updated 5 months ago
- µWheel DataFusion Optimizer for speeding up time-based analytics☆40Apr 4, 2025Updated 10 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Sep 30, 2024Updated last year
- Benchmarking library for stable Rust☆47Dec 21, 2025Updated 2 months ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆144Feb 12, 2026Updated 2 weeks ago
- ☆11Dec 2, 2024Updated last year
- Rust source code for all 650 leetcode hard algorithmic problems available with no subscription☆14Jul 6, 2025Updated 7 months ago
- Schema-aware JSON compression with millisecond lookups — cut transfer/storage while enabling exists*/pos* queries. (Demo + wheels; core i…☆24Feb 21, 2026Updated last week
- A rio-tiler plugin to create tiles using TMS (WebMercator or others)☆11Oct 13, 2020Updated 5 years ago
- Distributed pushdown cache for DataFusion☆385Feb 21, 2026Updated last week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,196Updated this week
- Rust based high-performance Apache Uniffle shuffle-server☆62Feb 10, 2026Updated 2 weeks ago
- A small helper library for working with python file-like objects with rust.☆47Feb 8, 2026Updated 2 weeks ago
- Convert sequences of Rust objects to Arrow tables☆98Updated this week
- Apache DataFusion Comet Spark Accelerator☆1,148Updated this week