JacekMajchrzak / awesome-datameshLinks
☆96Updated last year
Alternatives and similar repositories for awesome-datamesh
Users that are interested in awesome-datamesh are comparing it to the libraries listed below
Sorting:
- An open specification for data products in Data Mesh☆59Updated 6 months ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Sample Airflow DAGs☆62Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated last year
- A repository of sample code to accompany our blog post on Airflow and dbt.☆173Updated last year
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated this week
- Witboost is a versatile platform that addresses a wide range of sophisticated data engineering challenges. The Starter Kit showcases the …☆21Updated this week
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- The go to demo for public and private dbt Learn☆77Updated 2 months ago
- Weekly Data Engineering Newsletter☆95Updated 10 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- dbt-github-workflow is a boilerplate that contains all the necessary configurations to set up a simple CI/CD pipeline for your data model…☆17Updated 3 years ago
- ☆80Updated 7 months ago
- re_data - fix data issues before your users & CEO would discover them 😊☆98Updated last year
- Rules based grant management for Snowflake☆40Updated 6 years ago
- Data pipeline with dbt, Airflow, Great Expectations☆163Updated 3 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated last week
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆112Updated last year
- Data Mesh Architecture☆78Updated 11 months ago
- Data Tools Subjective List☆83Updated last year
- Creates simple data models on Snowflake to report dbt source freshness and tests☆26Updated last year
- Utility functions for dbt projects running on Spark☆34Updated 3 months ago
- Great Expectations Airflow operator☆165Updated this week
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- Demo project for dbt on Databricks☆32Updated 4 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- Big Data Demystified meetup and blog examples☆31Updated 9 months ago