dask-contrib / dask-sqlLinks
Distributed SQL Engine in Python using Dask
☆405Updated 9 months ago
Alternatives and similar repositories for dask-sql
Users that are interested in dask-sql are comparing it to the libraries listed below
Sorting:
- Apache DataFusion Python Bindings☆455Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆216Updated last week
- Database connectivity API standard and libraries for Apache Arrow☆450Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆250Updated 8 months ago
- Distributed SQL Query Engine in Python using Ray☆243Updated 8 months ago
- Ibis Substrait Compiler☆102Updated this week
- Turning PySpark Into a Universal DataFrame API☆403Updated this week
- ☆278Updated last week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆636Updated last month
- Pandas ExtensionDType/Array backed by Apache Arrow☆230Updated 2 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated 11 months ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆330Updated 2 years ago
- Native Kubernetes integration for Dask☆322Updated last month
- Making DAG construction easier☆265Updated 2 weeks ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,215Updated last week
- A library that provides useful extensions to Apache Spark and PySpark.☆224Updated 2 months ago
- Docker images for dask☆241Updated last week
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆141Updated last week
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆337Updated last month
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,324Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,082Updated 2 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- deferred, multi-engine computational framework☆277Updated this week
- Python binding for DataFusion☆59Updated 2 years ago
- Python bindings for sqlparser-rs☆186Updated 2 weeks ago
- python implementation of the parquet columnar file format.☆833Updated 2 months ago
- Apache DataFusion Ray☆194Updated 2 months ago
- A native Delta implementation for integration with any query engine☆233Updated this week
- Apache PyIceberg☆750Updated this week