datafusion-contrib / ray-sqlLinks
Distributed SQL Query Engine in Python using Ray
☆244Updated 10 months ago
Alternatives and similar repositories for ray-sql
Users that are interested in ray-sql are comparing it to the libraries listed below
Sorting:
- Apache DataFusion Ray☆217Updated 2 weeks ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆251Updated 4 months ago
- Pure Rust Iceberg Implementation☆162Updated last year
- Ibis Substrait Compiler☆104Updated last week
- TPC-H benchmark data generation in pure Rust☆131Updated this week
- New file format for storage of large columnar datasets.☆586Updated 2 weeks ago
- A native Delta implementation for integration with any query engine☆246Updated last week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆278Updated this week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆214Updated last week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆234Updated 3 weeks ago
- Distributed pushdown cache for DataFusion☆239Updated last week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆247Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆81Updated 10 months ago
- Comptaction runtime for Apache Iceberg.☆67Updated this week
- Apache DataFusion Benchmarks☆20Updated 4 months ago
- DataFusion TableProviders for reading data from other systems☆136Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- ☆47Updated last month
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆141Updated 2 weeks ago
- Data lake indices☆40Updated last month
- Boring Data Tool☆226Updated last year
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆42Updated 11 months ago
- View parquet files online☆172Updated 3 weeks ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆163Updated last month
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆62Updated last year
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- Apache Iceberg C++☆106Updated last week
- Apache DataFusion Python Bindings☆490Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆136Updated 3 weeks ago
- Apache DataFusion Comet Spark Accelerator☆1,022Updated last week