delta-io / delta-sharingLinks
An open protocol for secure data sharing
☆915Updated this week
Alternatives and similar repositories for delta-sharing
Users that are interested in delta-sharing are comparing it to the libraries listed below
Sorting:
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,395Updated this week
- An Open Standard for lineage metadata collection☆2,255Updated this week
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,799Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆426Updated 8 months ago
- PyIceberg☆978Updated this week
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆444Updated 5 months ago
- Collect, aggregate, and visualize a data ecosystem's metadata☆2,093Updated last week
- Egeria core☆888Updated this week
- New Generation Opensource Data Stack Demo☆454Updated 2 years ago
- ☆269Updated last year
- Open Control Plane for Tables in Data Lakehouse☆377Updated this week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆253Updated 3 weeks ago
- Dremio - the missing link in modern data☆1,461Updated 3 months ago
- Python API for Deequ☆809Updated 9 months ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆279Updated 3 months ago
- Data Lineage Tracking And Visualization Solution☆652Updated this week
- Python client for Trino☆408Updated 4 months ago
- Delta Lake helper methods in PySpark☆326Updated last year
- Home of the Open Data Contract Standard (ODCS).☆636Updated this week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆346Updated last year
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆377Updated 7 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆222Updated last month
- Generate and Visualize Data Lineage from query history☆327Updated 2 years ago
- PySpark test helper methods with beautiful error messages☆746Updated last week
- Apache DataFusion Comet Spark Accelerator☆1,102Updated this week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,141Updated this week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,271Updated this week
- Delta Lake examples☆236Updated last year
- Template for a data contract used in a data mesh.☆486Updated last year
- Official Dockerfile for Apache Spark☆162Updated this week