dask-contrib / dask-sql
Distributed SQL Engine in Python using Dask
☆401Updated 6 months ago
Alternatives and similar repositories for dask-sql:
Users that are interested in dask-sql are comparing it to the libraries listed below
- Apache DataFusion Python Bindings☆423Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆419Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆195Updated this week
- Ibis Substrait Compiler☆100Updated this week
- Turning PySpark Into a Universal DataFrame API☆375Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆235Updated 5 months ago
- Distributed SQL Query Engine in Python using Ray☆244Updated 5 months ago
- ☆231Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆323Updated last year
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆628Updated 2 weeks ago
- Apache PyIceberg☆640Updated this week
- A library that provides useful extensions to Apache Spark and PySpark.☆219Updated last week
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- A native Delta implementation for integration with any query engine☆206Updated this week
- Apache DataFusion Ray☆169Updated last week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆179Updated last week
- The native Rust implementation for Apache Hudi, with Python API bindings.☆200Updated this week
- Generate and Visualize Data Lineage from query history☆322Updated last year
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,274Updated this week
- Python binding for DataFusion☆59Updated 2 years ago
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- ☆68Updated 2 months ago
- Python bindings for sqlparser-rs☆175Updated last month
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆299Updated 9 months ago
- SQLAlchemy driver for DuckDB☆397Updated this week
- Boring Data Tool☆214Updated last year
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆230Updated 3 months ago
- Making DAG construction easier☆258Updated 2 weeks ago
- Apache DataFusion Comet Spark Accelerator☆918Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆393Updated last week