delta-incubator / datLinks
Delta Acceptance Testing
☆23Updated 5 months ago
Alternatives and similar repositories for dat
Users that are interested in dat are comparing it to the libraries listed below
Sorting:
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Updated 2 years ago
- Apache DataFusion Python Bindings☆554Updated this week
- Turning PySpark Into a Universal DataFrame API☆481Updated this week
- Dask integration for Snowflake☆30Updated 5 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated 2 years ago
- ☆59Updated 2 years ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆280Updated last year
- Boring Data Tool☆239Updated last year
- ✨ A Pydantic to PySpark schema library☆118Updated last week
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated last year
- Proof-of-concept extension combining the delta extension with Unity Catalog☆96Updated this week
- ☆70Updated last year
- ☆176Updated 8 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆232Updated last week
- A Python Library to support running data quality rules while the spark job is running⚡☆194Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Updated 2 years ago
- ☆374Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆224Updated last month
- Ibis Substrait Compiler☆109Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Updated last year
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Write your dbt models using Ibis☆74Updated 10 months ago
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆539Updated this week
- Run, mock and test fake Snowflake databases locally.☆164Updated 3 weeks ago
- ☆30Updated last year
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆95Updated 11 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆234Updated 3 months ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆46Updated last month
- Work with your web service, database, and streaming schemas in a single format.☆350Updated last month