delta-incubator / datLinks
Delta Acceptance Testing
☆20Updated last year
Alternatives and similar repositories for dat
Users that are interested in dat are comparing it to the libraries listed below
Sorting:
- ✨ A Pydantic to PySpark schema library☆99Updated this week
- Turning PySpark Into a Universal DataFrame API☆417Updated this week
- Dask integration for Snowflake☆30Updated 2 weeks ago
- Apache DataFusion Python Bindings☆482Updated last week
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Updated 2 years ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆81Updated 10 months ago
- ☆298Updated last week
- ☆30Updated 8 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆254Updated last year
- Write your dbt models using Ibis☆69Updated 4 months ago
- Proof-of-concept extension combining the delta extension with Unity Catalog☆89Updated 3 weeks ago
- Boring Data Tool☆226Updated last year
- A Python Library to support running data quality rules while the spark job is running⚡☆189Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated last week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆264Updated 10 months ago
- ☆70Updated 7 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆201Updated 2 weeks ago
- A library that provides useful extensions to Apache Spark and PySpark.☆228Updated 2 weeks ago
- Pythonic Iceberg REST Catalog☆3Updated last month
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆86Updated 5 months ago
- Database connectivity API standard and libraries for Apache Arrow☆476Updated this week
- ☆150Updated 2 months ago
- DuckDB extension for Delta Lake☆194Updated last week
- Work with your web service, database, and streaming schemas in a single format.☆345Updated last month
- Distributed SQL Engine in Python using Dask☆407Updated 11 months ago
- Fake Pandas / PySpark DataFrame creator☆47Updated last year
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated last year
- Coming soon☆61Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year