delta-io / delta-sharingLinks
An open protocol for secure data sharing
☆848Updated 3 weeks ago
Alternatives and similar repositories for delta-sharing
Users that are interested in delta-sharing are comparing it to the libraries listed below
Sorting:
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,252Updated this week
- An Open Standard for lineage metadata collection☆2,013Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆408Updated 2 months ago
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,580Updated this week
- Open Control Plane for Tables in Data Lakehouse☆359Updated this week
- Apache PyIceberg☆799Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,949Updated last week
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆436Updated 5 months ago
- Generate and Visualize Data Lineage from query history☆326Updated last year
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,136Updated this week
- Egeria core☆854Updated last week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- Python API for Deequ☆784Updated 3 months ago
- Data Lineage Tracking And Visualization Solution☆636Updated this week
- ☆266Updated 8 months ago
- Apache DataFusion Comet Spark Accelerator☆988Updated this week
- Dremio - the missing link in modern data☆1,436Updated 2 months ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆779Updated this week
- Delta Lake helper methods in PySpark☆324Updated 10 months ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆425Updated 3 years ago
- Open, Multi-modal Catalog for Data & AI☆2,983Updated 3 weeks ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆255Updated last week
- Home of the Open Data Contract Standard (ODCS).☆510Updated last month
- Drop-in replacement for Apache Spark UI☆273Updated last week
- Snowflake Data Source for Apache Spark.☆226Updated 3 weeks ago
- Python client for Trino☆383Updated 3 weeks ago
- Template for a data contract used in a data mesh.☆471Updated last year
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆242Updated this week
- 📙 Awesome Data Catalogs and Observability Platforms.☆870Updated 2 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆188Updated this week