dask-contrib / dask-sqlLinks
Distributed SQL Engine in Python using Dask
☆407Updated 11 months ago
Alternatives and similar repositories for dask-sql
Users that are interested in dask-sql are comparing it to the libraries listed below
Sorting:
- Apache DataFusion Python Bindings☆480Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆233Updated this week
- Ibis Substrait Compiler☆104Updated 2 weeks ago
- Database connectivity API standard and libraries for Apache Arrow☆476Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆264Updated 10 months ago
- Python binding for DataFusion☆59Updated 3 years ago
- ☆70Updated 6 months ago
- ☆297Updated last week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆639Updated this week
- Distributed SQL Query Engine in Python using Ray☆244Updated 10 months ago
- Turning PySpark Into a Universal DataFrame API☆416Updated this week
- A library that provides useful extensions to Apache Spark and PySpark.☆228Updated last week
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Updated last year
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- Python bindings for sqlparser-rs☆191Updated 2 months ago
- A native Delta implementation for integration with any query engine☆238Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆81Updated 10 months ago
- easy install parquet-tools☆180Updated last year
- Python client for Trino☆385Updated last month
- A command line tool to query an ODBC data source and write the result into a parquet file.☆239Updated this week
- SQLAlchemy driver for DuckDB☆441Updated this week
- Pandas ExtensionDType/Array backed by Apache Arrow☆231Updated 2 years ago
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆410Updated 2 months ago
- Apache DataFusion Ray☆214Updated 3 months ago
- ☆303Updated 2 weeks ago
- DuckDB extension for Delta Lake☆194Updated last week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,362Updated last month
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- Coming soon☆61Updated last year