dask-contrib / dask-sqlLinks
Distributed SQL Engine in Python using Dask
☆409Updated last year
Alternatives and similar repositories for dask-sql
Users that are interested in dask-sql are comparing it to the libraries listed below
Sorting:
- Apache DataFusion Python Bindings☆554Updated last week
- Ibis Substrait Compiler☆109Updated last week
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆267Updated last week
- Database connectivity API standard and libraries for Apache Arrow☆545Updated this week
- ☆70Updated last year
- Python binding for DataFusion☆59Updated 3 years ago
- ☆374Updated last week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆279Updated last year
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆652Updated this week
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- Turning PySpark Into a Universal DataFrame API☆481Updated last week
- A library that provides useful extensions to Apache Spark and PySpark.☆232Updated 2 weeks ago
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆332Updated 2 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 3 months ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- Native Kubernetes integration for Dask☆324Updated 3 weeks ago
- Python bindings for sqlparser-rs☆201Updated 8 months ago
- A native Delta implementation for integration with any query engine☆311Updated last week
- easy install parquet-tools☆184Updated last year
- Apache DataFusion Ray☆229Updated 4 months ago
- Generate and Visualize Data Lineage from query history☆327Updated 2 years ago
- SQLAlchemy driver for DuckDB☆482Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Updated last year
- Joblib Apache Spark Backend☆249Updated 10 months ago
- Python client for Trino☆411Updated 4 months ago
- A command line tool to query an ODBC data source and write the result into a parquet file.☆248Updated this week
- python implementation of the parquet columnar file format.☆881Updated last month
- DuckDB extension for Delta Lake☆212Updated this week