apache / datafusion-rayLinks
Apache DataFusion Ray
☆194Updated 2 months ago
Alternatives and similar repositories for datafusion-ray
Users that are interested in datafusion-ray are comparing it to the libraries listed below
Sorting:
- Rust implementation of Apache Iceberg with integration for Datafusion☆191Updated this week
- DataFusion TableProviders for reading data from other systems☆122Updated this week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆222Updated this week
- A native Delta implementation for integration with any query engine☆233Updated this week
- Pure Rust Iceberg Implementation☆163Updated 9 months ago
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆251Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆132Updated last month
- Open, Multi-modal Catalog for Data & AI, written in Rust☆79Updated 8 months ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 8 months ago
- Apache Spark Connect Client for Rust☆108Updated last month
- TPC-H benchmark data generation in pure Rust☆85Updated last month
- Batteries included CLI, TUI, and server implementations for DataFusion.☆154Updated 3 weeks ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆126Updated last month
- Boring Data Tool☆222Updated last year
- ☆278Updated last week
- ☆49Updated 3 weeks ago
- Database connectivity API standard and libraries for Apache Arrow☆452Updated this week
- Distributed pushdown cache for DataFusion☆172Updated last week
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- Apache DataFusion Python Bindings☆457Updated last week
- View parquet files online☆158Updated this week
- The Control Plane for Apache Iceberg☆59Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆253Updated 8 months ago
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆19Updated 3 months ago
- JSON support for DataFusion (unofficial)☆42Updated last month
- New file format for storage of large columnar datasets.☆556Updated last week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆218Updated this week
- A User-Defined Function Framework for Apache Arrow.☆92Updated 3 weeks ago
- Apache DataFusion Benchmarks☆19Updated 2 months ago