agile-lab-dev / Data-Product-Specification
An open specification for data products in Data Mesh
☆55Updated 2 months ago
Alternatives and similar repositories for Data-Product-Specification:
Users that are interested in Data-Product-Specification are comparing it to the libraries listed below
- Home of the Open Data Contract Standard (ODCS).☆432Updated last week
- A Python Library to support running data quality rules while the spark job is running⚡☆168Updated last week
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆74Updated this week
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- ☆16Updated 5 months ago
- The Data Product Descriptor Specification (DPDS) Repository☆76Updated 2 weeks ago
- Yet Another (Spark) ETL Framework☆18Updated last year
- Weekly Data Engineering Newsletter☆94Updated 6 months ago
- Library to convert DBT manifest metadata to Airflow tasks☆47Updated 10 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆192Updated this week
- Template for a data contract used in a data mesh.☆467Updated 10 months ago
- The Data Contract Specification Repository☆305Updated last week
- ☆73Updated 3 months ago
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including…☆147Updated this week
- Adapter for dbt that executes dbt pipelines on Apache Flink☆90Updated 10 months ago
- Delta Lake helper methods in PySpark☆315Updated 4 months ago
- Delta Lake Documentation☆48Updated 7 months ago
- Data Tools Subjective List☆82Updated last year
- Data product portal created by Dataminded☆172Updated this week
- Delta lake and filesystem helper methods☆50Updated 11 months ago
- ☆94Updated last year
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆77Updated this week
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆71Updated 3 years ago
- Quick Guides from Dremio on Several topics☆67Updated last week
- Schema modelling framework for decentralised domain-driven ownership of data.☆249Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆63Updated 4 months ago
- Define, govern, and model event data for warehouse-first product analytics.☆82Updated 7 months ago
- The go to demo for public and private dbt Learn☆74Updated 4 months ago
- A Table format agnostic data sharing framework☆38Updated 11 months ago