datafusion-contrib / ray-sql
Distributed SQL Query Engine in Python using Ray
☆244Updated 5 months ago
Alternatives and similar repositories for ray-sql:
Users that are interested in ray-sql are comparing it to the libraries listed below
- Apache DataFusion Ray☆169Updated last week
- Pure Rust Iceberg Implementation☆163Updated 7 months ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆236Updated 10 months ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆152Updated last week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆229Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆195Updated this week
- A native Delta implementation for integration with any query engine☆206Updated this week
- Ibis Substrait Compiler☆100Updated this week
- The native Rust implementation for Apache Hudi, with Python API bindings.☆200Updated this week
- New file format for storage of large columnar datasets.☆492Updated this week
- DataFusion TableProviders for reading data from other systems☆92Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆106Updated last week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆78Updated 5 months ago
- Apache DataFusion Python Bindings☆423Updated this week
- Apache Paimon Rust The rust implementation of Apache Paimon.☆110Updated 5 months ago
- Boring Data Tool☆214Updated 11 months ago
- View parquet files online☆137Updated this week
- CMU-DB's Cascades optimizer framework☆396Updated 2 months ago
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆17Updated last month
- Apache Iceberg C++☆49Updated 3 weeks ago
- A User-Defined Function Framework for Apache Arrow.☆88Updated this week
- Apache DataFusion Comet Spark Accelerator☆918Updated this week
- ☆82Updated this week
- Batteries included CLI, TUI, and server implementations for DataFusion.☆141Updated this week
- Quickly view your data☆300Updated last week
- Apache Iceberg☆841Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆121Updated last week
- ☆231Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆419Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆323Updated last year