luatnc87 / open-source-modern-data-stackLinks
This repo demonstrate a comprehensive modern data stack using popular open-source tools.
☆30Updated last year
Alternatives and similar repositories for open-source-modern-data-stack
Users that are interested in open-source-modern-data-stack are comparing it to the libraries listed below
Sorting:
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆83Updated last year
- This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source to…☆30Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆149Updated this week
- ☆28Updated 4 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆35Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆42Updated 6 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆230Updated 3 months ago
- Fivetran's Shopify dbt package☆65Updated last week
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆4Updated last week
- Where the Meltano team runs Meltano! Get it???☆28Updated last month
- A map transformer which implements the `Stream Maps` capability from Meltano's tap and target SDK: https://sdk.meltano.com/☆18Updated this week
- ☆148Updated this week
- Repo for CDC with debezium blog post☆28Updated 8 months ago
- This repository is a production dbt pipeline example that model the profitability of an e-commerce business. Data is extracted and loaded…☆23Updated 11 months ago
- This repository contains examples of how to use dbt's metric functionality on the jaffle shop dataset☆29Updated 6 months ago
- Droughty helps keep your workflow dry☆66Updated this week
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆27Updated last year
- Contribute to dlt verified sources 🔥☆85Updated this week
- Data models for Hubspot built using dbt.☆35Updated last month
- New generation opensource data stack☆68Updated 3 years ago
- ☆203Updated 4 months ago
- Code for dbt tutorial☆157Updated last year
- Demo Project for Open Source MDS☆168Updated last week
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆117Updated this week
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆25Updated 2 years ago
- ☆37Updated 2 months ago
- Enables Python developers to leverage Debezium's CDC capabilities with custom event handlers and seamless integration.☆29Updated last month
- A monorepo of many Rill example projects☆37Updated last week
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆202Updated 3 weeks ago