delta-io / delta-sharingLinks
An open protocol for secure data sharing
☆887Updated last week
Alternatives and similar repositories for delta-sharing
Users that are interested in delta-sharing are comparing it to the libraries listed below
Sorting:
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,360Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆417Updated 6 months ago
- An Open Standard for lineage metadata collection☆2,178Updated this week
- PyIceberg☆913Updated last week
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,727Updated this week
- Open Control Plane for Tables in Data Lakehouse☆371Updated last week
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆442Updated 3 months ago
- Collect, aggregate, and visualize a data ecosystem's metadata☆2,058Updated last week
- Egeria core☆876Updated last week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆346Updated last year
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆251Updated 2 months ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,025Updated this week
- Data Lineage Tracking And Visualization Solution☆647Updated this week
- Generate and Visualize Data Lineage from query history☆326Updated 2 years ago
- Drop-in replacement for Apache Spark UI☆341Updated 2 weeks ago
- ☆269Updated last year
- Apache DataFusion Comet Spark Accelerator☆1,065Updated this week
- Snowflake Data Source for Apache Spark.☆230Updated 3 weeks ago
- Python API for Deequ☆803Updated 7 months ago
- Dremio - the missing link in modern data☆1,451Updated last month
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆292Updated this week
- 📙 Awesome Data Catalogs and Observability Platforms.☆936Updated 3 months ago
- Python client for Trino☆405Updated 2 months ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆795Updated last week
- New Generation Opensource Data Stack Demo☆449Updated 2 years ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,222Updated last week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆221Updated last month
- Delta Lake helper methods in PySpark☆323Updated last year
- Template for a data contract used in a data mesh.☆480Updated last year
- PySpark test helper methods with beautiful error messages☆724Updated last month