dask-contrib / dask-deltatableLinks
A Delta Lake reader for Dask
☆53Updated last week
Alternatives and similar repositories for dask-deltatable
Users that are interested in dask-deltatable are comparing it to the libraries listed below
Sorting:
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Arrow, pydantic style☆84Updated 2 years ago
- Dask integration for Snowflake☆30Updated 7 months ago
- Coming soon☆61Updated last year
- A Minimalistic Rust Implementation of Delta Sharing Server.☆92Updated 3 months ago
- ☆38Updated this week
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆88Updated last month
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆24Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆143Updated this week
- Python binding for DataFusion☆59Updated 2 years ago
- DB API 2 interface for Flight SQL with SQLAlchemy extras.☆39Updated 3 months ago
- ☆33Updated last year
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆33Updated 2 years ago
- ☆58Updated last year
- ☆70Updated 6 months ago
- Ibis Substrait Compiler☆103Updated this week
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆83Updated 4 months ago
- Write your dbt models using Ibis☆68Updated 3 months ago
- A leightweight UI for Lakekeeper☆13Updated this week
- Distributed SQL Engine in Python using Dask☆406Updated 10 months ago
- ☆52Updated this week
- Apache Arrow Ballista Python bindings☆37Updated last year
- An experimental Athena extension for DuckDB 🐤☆54Updated 6 months ago
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆13Updated 4 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 11 months ago
- Proof-of-concept extension combining the delta extension with Unity Catalog☆89Updated 3 weeks ago
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆230Updated this week
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆96Updated this week
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆54Updated last week