Distributed SQL Engine in Python using Dask
☆410Aug 29, 2024Updated last year
Alternatives and similar repositories for dask-sql
Users that are interested in dask-sql are comparing it to the libraries listed below
Sorting:
- Apache DataFusion Python Bindings☆564Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,977Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,138Updated this week
- GlareDB: A light and fast SQL database for analytics☆1,001Nov 14, 2025Updated 3 months ago
- Query and transform data with PRQL☆137Sep 23, 2023Updated 2 years ago
- Distributed SQL Query Engine in Python using Ray☆245Oct 2, 2024Updated last year
- Ibis Substrait Compiler☆109Feb 20, 2026Updated last week
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆2,007Sep 16, 2022Updated 3 years ago
- Apache DataFusion SQL Query Engine☆8,428Updated this week
- Native Kubernetes integration for Dask☆324Jan 13, 2026Updated last month
- Analytical database for data-driven Web applications 🪶☆509Feb 25, 2025Updated last year
- Batteries included CLI, TUI, and server implementations for DataFusion.☆189Feb 16, 2026Updated last week
- Database connectivity API standard and libraries for Apache Arrow☆556Updated this week
- the portable Python dataframe library☆6,404Feb 21, 2026Updated last week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,476Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆280Sep 25, 2024Updated last year
- A Delta Lake reader for Dask☆53Jul 29, 2025Updated 6 months ago
- Real-time stream processing for python☆1,293Updated this week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆653Feb 4, 2026Updated 3 weeks ago
- Rust DataFusion Server☆25Feb 4, 2026Updated 3 weeks ago
- Boring Data Tool☆241Mar 21, 2024Updated last year
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,562Feb 2, 2026Updated 3 weeks ago
- Apache Arrow Flight SQL adapter for PostgreSQL☆101Dec 16, 2025Updated 2 months ago
- A purely experimental DuckDB Deltalake extension☆95Feb 18, 2026Updated last week
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,074Updated this week
- Coming soon☆62Nov 9, 2023Updated 2 years ago
- A native Rust library for Delta Lake, with bindings into Python☆3,156Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆333Mar 28, 2023Updated 2 years ago
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,715Updated this week
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,250Updated this week
- Apache DataFusion Comet Spark Accelerator☆1,142Feb 20, 2026Updated last week
- Parallel computing with task scheduling☆13,746Updated this week
- Databend SQLAlchemy☆15Sep 1, 2025Updated 5 months ago
- Python bindings for sqlparser-rs☆202May 17, 2025Updated 9 months ago
- Flock: A Low-Cost Streaming Query Engine on FaaS Platforms☆278Dec 29, 2023Updated 2 years ago
- Apache Iceberg☆1,229Updated this week
- New file format for storage of large columnar datasets.☆693Updated this week
- Making data lake work for time series☆1,189Aug 21, 2024Updated last year
- PRQL as a DuckDB extension☆318Sep 22, 2025Updated 5 months ago