datafusion-contrib / ray-sql
Distributed SQL Query Engine in Python using Ray
☆243Updated 4 months ago
Alternatives and similar repositories for ray-sql:
Users that are interested in ray-sql are comparing it to the libraries listed below
- Apache DataFusion Ray☆157Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆189Updated this week
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆235Updated 9 months ago
- Ibis Substrait Compiler☆98Updated this week
- Pure Rust Iceberg Implementation☆164Updated 6 months ago
- New file format for storage of large columnar datasets.☆480Updated 2 weeks ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆140Updated this week
- A native Delta implementation for integration with any query engine☆186Updated last week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆220Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Apache DataFusion Python Bindings☆411Updated last week
- Apache Arrow Flight SQL adapter for PostgreSQL☆75Updated last month
- ☆209Updated this week
- ☆41Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆76Updated 4 months ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆109Updated 4 months ago
- Database connectivity API standard and libraries for Apache Arrow☆406Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆231Updated 4 months ago
- An opinionated and batteries included DataFusion implementation.☆131Updated last week
- The native Rust implementation for Apache Hudi, with Python API bindings.☆193Updated this week
- ☆105Updated last year
- ☆80Updated this week
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆16Updated this week
- Query Plan Markup Language☆45Updated last year
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- DataFusion TableProviders for reading data from other systems☆79Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆94Updated this week
- Embeddable Aggregate Management System for Streams and Queries.☆90Updated last month
- ☆33Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆318Updated last year