Distributed SQL Query Engine in Python using Ray
☆245Oct 2, 2024Updated last year
Alternatives and similar repositories for ray-sql
Users that are interested in ray-sql are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experimental DataFusion Optimizer☆52Jun 9, 2023Updated 2 years ago
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆236Updated this week
- Apache DataFusion Ray☆230Oct 5, 2025Updated 6 months ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆194Apr 14, 2026Updated 2 weeks ago
- Apache DataFusion Comet Spark Accelerator☆1,174Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Apache DataFusion Ballista Distributed Query Engine☆2,021Updated this week
- ☆33May 9, 2025Updated 11 months ago
- SQL Benchmark derived from TPC-H☆11May 20, 2023Updated 2 years ago
- Apache Iceberg☆1,270Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,501Updated this week
- Query Plan Markup Language☆45Jan 18, 2024Updated 2 years ago
- A leightweight UI for Lakekeeper☆16Updated this week
- Apache DataFusion Python Bindings☆580Updated this week
- Pure Rust Iceberg Implementation☆163Aug 13, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,758Updated this week
- Apache DataFusion SQL Query Engine☆8,639Apr 24, 2026Updated last week
- SQLBench Runners☆13Dec 17, 2023Updated 2 years ago
- GlareDB: A light and fast SQL database for analytics☆1,010Nov 14, 2025Updated 5 months ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆371Apr 10, 2026Updated 2 weeks ago
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆384Jul 31, 2024Updated last year
- Distributed SQL Engine in Python using Dask☆411Aug 29, 2024Updated last year
- CMU-DB's Cascades optimizer framework☆405Jan 6, 2025Updated last year
- Making data lake work for time series☆1,191Aug 21, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆322Updated this week
- Rust lib to read from Apache ORC☆18Jun 9, 2023Updated 2 years ago
- Exoshuffle-CloudSort☆29Mar 2, 2023Updated 3 years ago
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Apr 23, 2026Updated last week
- Cache server :)☆32Sep 5, 2023Updated 2 years ago
- A composable and fully extensible C++ execution engine library for data management systems.☆4,112Updated this week
- Analytical database for data-driven Web applications 🪶☆514Feb 25, 2025Updated last year
- Boring Data Tool☆242Mar 21, 2024Updated 2 years ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,363Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Quickly view your data☆354Apr 22, 2026Updated last week
- Tools for generating TPC-* datasets☆31Jun 23, 2024Updated last year
- A native storage format for apache arrow☆83Oct 18, 2023Updated 2 years ago
- A reader that buffers ranged calls☆12May 17, 2022Updated 3 years ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,552Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆3,204Updated this week
- ☆15Mar 26, 2026Updated last month