delta-io / delta-sharing
An open protocol for secure data sharing
☆833Updated this week
Alternatives and similar repositories for delta-sharing
Users that are interested in delta-sharing are comparing it to the libraries listed below
Sorting:
- An Open Standard for lineage metadata collection☆1,931Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,202Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆397Updated last week
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆428Updated 3 months ago
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,474Updated this week
- Open Control Plane for Tables in Data Lakehouse☆348Updated last week
- Python API for Deequ☆768Updated last month
- Apache PyIceberg☆714Updated this week
- Apache DataFusion Comet Spark Accelerator☆944Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,909Updated this week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,083Updated this week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆749Updated this week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆345Updated 11 months ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆632Updated this week
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆425Updated 3 years ago
- Open, Multi-modal Catalog for Data & AI☆2,833Updated 2 weeks ago
- Dremio - the missing link in modern data☆1,427Updated last week
- Template for a data contract used in a data mesh.☆472Updated last year
- Data Lineage Tracking And Visualization Solution☆623Updated last week
- 📙 Awesome Data Catalogs and Observability Platforms.☆845Updated 3 weeks ago
- Performance Observability for Apache Spark☆248Updated last month
- Generate and Visualize Data Lineage from query history☆324Updated last year
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)☆1,070Updated last week
- Turning PySpark Into a Universal DataFrame API☆391Updated this week
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.☆368Updated last week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆215Updated this week
- PySpark test helper methods with beautiful error messages☆688Updated 3 weeks ago
- Snowflake Data Source for Apache Spark.☆225Updated last month
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆233Updated last month
- pyspark methods to enhance developer productivity 📣 👯 🎉☆669Updated 2 months ago