Library for bringing distributed capabilities to Apache DataFusion
☆72Mar 18, 2026Updated this week
Alternatives and similar repositories for datafusion-distributed
Users that are interested in datafusion-distributed are comparing it to the libraries listed below
Sorting:
- An experimental (work-in-progress) statically typed implementation of Apache Arrow☆28Feb 16, 2026Updated last month
- Python Package for ducklake☆20Jun 5, 2025Updated 9 months ago
- Data Engineering framework written in Python based in Polars.☆14May 1, 2024Updated last year
- Apache DataFusion Ray☆228Oct 5, 2025Updated 5 months ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆173Mar 4, 2026Updated 2 weeks ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆270Updated this week
- A fine push-down parquet scanner in Rust.☆37Updated this week
- Fast approximate joins on string columns for polars dataframes.☆16Dec 24, 2025Updated 2 months ago
- A collection of resources about DataFusion☆17Nov 11, 2024Updated last year
- Scalable datastore for metrics, events, and real-time analytics - InfluxDB3-Core fork Unlocked w/ Enterprise Features☆29Updated this week
- Tantivy directory implementation backed by object_store☆40Jan 22, 2024Updated 2 years ago
- Apache Arrow database client for many databases.☆49Updated this week
- Tracks the Rerun open source work☆11Oct 3, 2022Updated 3 years ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆150Updated this week
- Building block library for using Apache Arrow in Rust WebAssembly modules.☆28Feb 19, 2026Updated last month
- Apache DataFusion Benchmarks☆22Mar 3, 2026Updated 2 weeks ago
- Simple bloom filter☆12Feb 9, 2022Updated 4 years ago
- Rust DataFusion Server☆25Updated this week
- A simplified Try-Confirm/Cancel (TCC) pattern implementation with Flight/RentalCar reservation business workflow.☆11Apr 3, 2019Updated 6 years ago
- Alternative admin UI for CrateDB databases☆12Oct 27, 2025Updated 4 months ago
- Provides time series data and metadata as Apache Arrow.☆16Updated this week
- Incremental view maintenance & query rewriting for materialized views in DataFusion☆70Mar 9, 2026Updated last week
- Embeddable Aggregate Management System for Streams and Queries.☆109Feb 25, 2026Updated 3 weeks ago
- Rust object_store crate☆229Updated this week
- React hook to run pyodide in a web worker☆12Jan 29, 2025Updated last year
- InfluxData's core functionality for InfluxDB Edge and IOx☆51Jan 28, 2026Updated last month
- Framework to build data pipelines declaratively☆95Dec 6, 2025Updated 3 months ago
- results cache for Apache DataFusion☆33Oct 29, 2024Updated last year
- Pushdown cache for DataFusion☆390Mar 14, 2026Updated last week
- An easy-to-use structured prompt builder for LLMs in TypeScript.☆17Nov 8, 2024Updated last year
- Use Claude Code & other AI agents from inside DuckDB via extension☆50Dec 11, 2025Updated 3 months ago
- (Experimental) Template for Rust-based DuckDB extensions☆100Mar 12, 2026Updated last week
- Apache DataFusion Comet Spark Accelerator☆1,154Updated this week
- ☆15Jan 9, 2025Updated last year
- A conda-smithy repository for polars.☆12Updated this week
- Apache Iceberg☆1,243Updated this week
- ☆17Sep 20, 2021Updated 4 years ago
- Benchmarking library for stable Rust☆49Dec 21, 2025Updated 3 months ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,221Updated this week