datafusion-contrib / ray-sql
Distributed SQL Query Engine in Python using Ray
☆243Updated 6 months ago
Alternatives and similar repositories for ray-sql:
Users that are interested in ray-sql are comparing it to the libraries listed below
- Apache DataFusion Ray☆183Updated 2 weeks ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆239Updated last week
- A native Delta implementation for integration with any query engine☆220Updated this week
- Ibis Substrait Compiler☆102Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆203Updated this week
- Rust implementation of Apache Iceberg with integration for Datafusion☆163Updated this week
- Pure Rust Iceberg Implementation☆163Updated 8 months ago
- New file format for storage of large columnar datasets.☆522Updated this week
- The native Rust implementation for Apache Hudi, with Python API bindings.☆207Updated last week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆231Updated this week
- Apache DataFusion Python Bindings☆439Updated this week
- DataFusion TableProviders for reading data from other systems☆102Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Boring Data Tool☆217Updated last year
- Batteries included CLI, TUI, and server implementations for DataFusion.☆146Updated this week
- Apache Iceberg C++☆63Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆243Updated 6 months ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆78Updated 6 months ago
- Database connectivity API standard and libraries for Apache Arrow☆428Updated this week
- Apache DataFusion Benchmarks☆18Updated last week
- View parquet files online☆148Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆119Updated this week
- Apache DataFusion Comet Spark Accelerator☆930Updated this week
- TPC-H benchmark data generation in pure Rust☆34Updated this week
- Towards a New File Format☆218Updated last month
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆17Updated 2 months ago
- ☆252Updated this week
- ☆46Updated this week
- Quickly view your data☆304Updated last week
- A User-Defined Function Framework for Apache Arrow.☆89Updated this week