dask-contrib / dask-sql
Distributed SQL Engine in Python using Dask
☆397Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for dask-sql
- Apache DataFusion Python Bindings☆375Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆384Updated this week
- Ibis Substrait Compiler☆95Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆205Updated last month
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆623Updated last week
- Native Kubernetes integration for Dask☆312Updated 2 weeks ago
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆165Updated 2 weeks ago
- Apache PyIceberg☆473Updated this week
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- Turning PySpark Into a Universal DataFrame API☆323Updated this week
- ☆159Updated last month
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆306Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆134Updated last month
- SQLAlchemy driver for DuckDB☆355Updated this week
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆298Updated 5 months ago
- A native Delta implementation for integration with any query engine☆144Updated this week
- Docker images for dask☆232Updated last week
- Distributed SQL Query Engine in Python using Ray☆239Updated last month
- A consistent table management library in python☆161Updated last year
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,013Updated last month
- ☆242Updated 2 months ago
- A purely experimental DuckDB Deltalake extension☆94Updated 2 weeks ago
- A command line tool to query an ODBC data source and write the result into a parquet file.☆223Updated this week
- Python binding for DataFusion☆59Updated 2 years ago
- A data modelling layer built on top of polars and pydantic☆197Updated last year
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆315Updated 3 months ago
- DuckDB extension for Delta Lake☆136Updated last week
- JupyterLab extension for Dask☆312Updated last year
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,205Updated this week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆148Updated last week