delta-incubator / datLinks
Delta Acceptance Testing
☆21Updated 4 months ago
Alternatives and similar repositories for dat
Users that are interested in dat are comparing it to the libraries listed below
Sorting:
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Updated 2 years ago
- ✨ A Pydantic to PySpark schema library☆115Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆85Updated last year
- Dask integration for Snowflake☆30Updated 4 months ago
- Apache DataFusion Python Bindings☆543Updated this week
- ☆70Updated 11 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆232Updated 2 weeks ago
- ☆355Updated 2 weeks ago
- A Python Library to support running data quality rules while the spark job is running⚡☆193Updated this week
- Distributed SQL Engine in Python using Dask☆409Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆222Updated 3 weeks ago
- Turning PySpark Into a Universal DataFrame API☆469Updated 2 weeks ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆46Updated 2 weeks ago
- Ibis Substrait Compiler☆108Updated last week
- Write your dbt models using Ibis☆74Updated 9 months ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆278Updated last year
- ☆167Updated 7 months ago
- Database connectivity API standard and libraries for Apache Arrow☆526Updated this week
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆235Updated 11 months ago
- Apache DataFusion Ray☆227Updated 2 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- ☆59Updated last year
- Boring Data Tool☆239Updated last year
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated last year
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Proof-of-concept extension combining the delta extension with Unity Catalog☆95Updated last week
- Schema modelling framework for decentralised domain-driven ownership of data.☆260Updated 2 years ago
- Arrow, pydantic style☆85Updated 3 years ago
- Work with your web service, database, and streaming schemas in a single format.☆348Updated 3 months ago