agile-lab-dev / Data-Product-SpecificationLinks
An open specification for data products in Data Mesh
☆63Updated last month
Alternatives and similar repositories for Data-Product-Specification
Users that are interested in Data-Product-Specification are comparing it to the libraries listed below
Sorting:
- The Data Contract Specification Repository☆382Updated last month
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆180Updated last year
- Data product portal created by Dataminded☆192Updated this week
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆268Updated 3 weeks ago
- The Data Product Descriptor Specification (DPDS) Repository☆81Updated 9 months ago
- Yet Another (Spark) ETL Framework☆21Updated 2 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Delta Lake examples☆230Updated last year
- Home of the Open Data Contract Standard (ODCS).☆570Updated last week
- lakefs-samples repository☆86Updated 3 weeks ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆259Updated last year
- Template for a data contract used in a data mesh.☆476Updated last year
- A Table format agnostic data sharing framework☆40Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆165Updated last month
- Data Mesh Architecture☆82Updated last week
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated last week
- ☆97Updated 2 years ago
- Delta Lake Documentation☆50Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆220Updated 3 weeks ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Updated 3 years ago
- ☆35Updated last month
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆46Updated 9 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆190Updated this week
- Data Tools Subjective List☆86Updated 2 years ago
- Unity Catalog UI☆43Updated last year
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆128Updated 3 weeks ago
- Delta Lake helper methods in PySpark☆323Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆168Updated last week
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- The Picnic Data Vault framework.☆130Updated last year