datafusion-contrib / ray-sql
Distributed SQL Query Engine in Python using Ray
☆243Updated 7 months ago
Alternatives and similar repositories for ray-sql:
Users that are interested in ray-sql are comparing it to the libraries listed below
- Apache DataFusion Ray☆188Updated last month
- Pure Rust Iceberg Implementation☆163Updated 8 months ago
- Ibis Substrait Compiler☆102Updated this week
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆241Updated last month
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆207Updated this week
- Rust implementation of Apache Iceberg with integration for Datafusion☆175Updated this week
- New file format for storage of large columnar datasets.☆538Updated this week
- A native Delta implementation for integration with any query engine☆226Updated this week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆238Updated this week
- The native Rust implementation for Apache Hudi, with Python API bindings.☆211Updated last week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- DataFusion TableProviders for reading data from other systems☆110Updated last week
- Batteries included CLI, TUI, and server implementations for DataFusion.☆152Updated this week
- ☆261Updated this week
- ☆47Updated this week
- Apache Paimon Rust The rust implementation of Apache Paimon.☆122Updated 2 weeks ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆79Updated 7 months ago
- Apache DataFusion Benchmarks☆18Updated last month
- Apache DataFusion Python Bindings☆448Updated last week
- Boring Data Tool☆218Updated last year
- ☆85Updated this week
- CMU-DB's Cascades optimizer framework☆397Updated 4 months ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆249Updated 7 months ago
- View parquet files online☆151Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆328Updated 2 years ago
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆187Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆127Updated last week
- Apache Iceberg C++☆63Updated this week
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆18Updated 2 months ago
- Quickly view your data☆308Updated last month