dask-contrib / dask-sqlLinks
Distributed SQL Engine in Python using Dask
☆407Updated 11 months ago
Alternatives and similar repositories for dask-sql
Users that are interested in dask-sql are comparing it to the libraries listed below
Sorting:
- Apache DataFusion Python Bindings☆490Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆477Updated this week
- Ibis Substrait Compiler☆104Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆234Updated 3 weeks ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆267Updated 10 months ago
- ☆70Updated 7 months ago
- ☆302Updated this week
- Python binding for DataFusion☆59Updated 3 years ago
- Turning PySpark Into a Universal DataFrame API☆422Updated this week
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Updated last year
- Distributed SQL Query Engine in Python using Ray☆244Updated 10 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆229Updated last month
- SQLAlchemy driver for DuckDB☆447Updated this week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆642Updated this week
- Python bindings for sqlparser-rs☆193Updated 3 months ago
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- Apache DataFusion Ray☆217Updated 2 weeks ago
- easy install parquet-tools☆182Updated last year
- Pandas ExtensionDType/Array backed by Apache Arrow☆231Updated 2 years ago
- Python client for Trino☆389Updated last week
- A native Delta implementation for integration with any query engine☆246Updated last week
- Boring Data Tool☆226Updated last year
- Generate and Visualize Data Lineage from query history☆326Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Coming soon☆61Updated last year
- A command line tool to query an ODBC data source and write the result into a parquet file.☆240Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆81Updated 10 months ago
- SQLAlchemy for Dremio via the ODBC and Flight interface.☆30Updated 2 months ago
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆593Updated this week