apache / datafusion-ray
Apache DataFusion Ray
☆116Updated this week
Related projects ⓘ
Alternatives and complementary repositories for datafusion-ray
- A native Delta implementation for integration with any query engine☆144Updated this week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆174Updated this week
- Pure Rust Iceberg Implementation☆166Updated 3 months ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆107Updated this week
- A native Rust library for Apache Hudi, with bindings into Python☆146Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆77Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆74Updated last month
- Distributed SQL Query Engine in Python using Ray☆239Updated last month
- DataFusion TableProviders for reading data from other systems☆61Updated this week
- A purely experimental DuckDB Deltalake extension☆94Updated 2 weeks ago
- Lakekeeper: A Rust native Iceberg REST Catalog☆234Updated this week
- An opinionated and batteries included DataFusion implementation.☆114Updated this week
- Apache Paimon Rust The rust implementation of Apache Paimon.☆100Updated last month
- ☆31Updated this week
- Boring Data Tool☆209Updated 7 months ago
- Apache Spark Connect Client for Rust☆90Updated 2 weeks ago
- Apache Iceberg☆658Updated this week
- ☆159Updated last month
- A User-Defined Function Framework for Apache Arrow.☆77Updated this week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆148Updated last week
- Apache DataFusion Python Bindings☆375Updated this week
- ☆33Updated last year
- Experimental support for serializing DataFusion plans using substrait☆44Updated last year
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆228Updated 6 months ago
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆16Updated last month
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆51Updated 2 weeks ago
- DuckDB extension for Delta Lake☆136Updated last week
- Embeddable Aggregate Management System for Streams and Queries.☆83Updated 2 weeks ago
- Pythonic Iceberg REST Catalog☆67Updated 2 months ago