dask-contrib / dask-sqlLinks
Distributed SQL Engine in Python using Dask
☆405Updated 9 months ago
Alternatives and similar repositories for dask-sql
Users that are interested in dask-sql are comparing it to the libraries listed below
Sorting:
- Apache DataFusion Python Bindings☆460Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆458Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆222Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆255Updated 8 months ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆637Updated this week
- Ibis Substrait Compiler☆103Updated this week
- Distributed SQL Query Engine in Python using Ray☆243Updated 8 months ago
- Turning PySpark Into a Universal DataFrame API☆407Updated last week
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- ☆284Updated this week
- Native Kubernetes integration for Dask☆323Updated this week
- ☆70Updated 5 months ago
- Python client for Trino☆376Updated this week
- Pandas ExtensionDType/Array backed by Apache Arrow☆230Updated 2 years ago
- A library that provides useful extensions to Apache Spark and PySpark.☆225Updated 3 months ago
- Python bindings for sqlparser-rs☆191Updated last month
- A native Delta implementation for integration with any query engine☆234Updated this week
- ☆293Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- Python binding for DataFusion☆59Updated 2 years ago
- Apache PyIceberg☆782Updated this week
- DuckDB extension for Delta Lake☆192Updated this week
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆407Updated last month
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆225Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Coming soon☆61Updated last year
- Boring Data Tool☆223Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆143Updated this week