apache / datafusion-ray
Apache DataFusion Ray
☆191Updated last month
Alternatives and similar repositories for datafusion-ray
Users that are interested in datafusion-ray are comparing it to the libraries listed below
Sorting:
- Rust implementation of Apache Iceberg with integration for Datafusion☆177Updated last week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆241Updated this week
- Pure Rust Iceberg Implementation☆163Updated 9 months ago
- DataFusion TableProviders for reading data from other systems☆116Updated this week
- A native Delta implementation for integration with any query engine☆225Updated last week
- The native Rust implementation for Apache Hudi, with Python API bindings.☆217Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆127Updated 3 weeks ago
- Distributed SQL Query Engine in Python using Ray☆244Updated 7 months ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆153Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆79Updated 7 months ago
- Apache DataFusion Python Bindings☆451Updated this week
- TPC-H benchmark data generation in pure Rust☆67Updated 2 weeks ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆122Updated 3 weeks ago
- Apache Spark Connect Client for Rust☆107Updated 2 weeks ago
- The Control Plane for Apache Iceberg☆43Updated this week
- ☆267Updated this week
- Boring Data Tool☆220Updated last year
- Message queue and data streaming based on cloud native services.☆110Updated 3 weeks ago
- ☆48Updated this week
- View parquet files online☆152Updated last week
- Apache Iceberg☆923Updated this week
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆241Updated last month
- Database connectivity API standard and libraries for Apache Arrow☆437Updated this week
- A User-Defined Function Framework for Apache Arrow.☆89Updated last week
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆39Updated 7 months ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆250Updated 7 months ago
- New file format for storage of large columnar datasets.☆541Updated last week
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆18Updated 3 months ago
- Apache DataFusion Benchmarks☆18Updated last month
- ☆33Updated this week