delta-io / delta-sharingLinks
An open protocol for secure data sharing
☆861Updated last week
Alternatives and similar repositories for delta-sharing
Users that are interested in delta-sharing are comparing it to the libraries listed below
Sorting:
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,289Updated last week
- An Open Standard for lineage metadata collection☆2,086Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆411Updated 3 months ago
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,996Updated last week
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,624Updated last week
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆441Updated last month
- Apache PyIceberg☆836Updated this week
- Python API for Deequ☆788Updated 4 months ago
- Egeria core☆859Updated last week
- Open Control Plane for Tables in Data Lakehouse☆366Updated last week
- Generate and Visualize Data Lineage from query history☆326Updated 2 years ago
- Data Lineage Tracking And Visualization Solution☆638Updated this week
- PySpark test helper methods with beautiful error messages☆713Updated 3 weeks ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,152Updated this week
- Delta Lake helper methods in PySpark☆325Updated 11 months ago
- CLI that makes it easy to create, test and deploy Airflow DAGs to Astronomer☆405Updated last week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- Apache DataFusion Comet Spark Accelerator☆1,028Updated this week
- Home of the Open Data Contract Standard (ODCS).☆526Updated last week
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆374Updated 3 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆189Updated 2 weeks ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆246Updated last month
- ☆267Updated 10 months ago
- Snowflake Data Source for Apache Spark.☆229Updated this week
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆259Updated 3 weeks ago
- Drop-in replacement for Apache Spark UI☆293Updated last week
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆427Updated 3 years ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated 3 weeks ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆279Updated this week
- Dremio - the missing link in modern data☆1,437Updated 3 months ago