josephmachado / change_data_capture
Repo for CDC with debezium blog post
☆25Updated this week
Related projects: ⓘ
- Code for my "Efficient Data Processing in SQL" book.☆47Updated last month
- Code for dbt tutorial☆138Updated 3 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆52Updated 5 months ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆51Updated last year
- Cost Efficient Data Pipelines with DuckDB☆42Updated last month
- Sample project to demonstrate data engineering best practices☆156Updated 6 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆47Updated 3 months ago
- A custom end-to-end data pipeline for customer churn☆9Updated this week
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆100Updated 2 months ago
- End to end data engineering project☆49Updated last year
- Simple stream processing pipeline☆89Updated 3 months ago
- Template for Data Engineering and Data Pipeline projects☆101Updated last year
- Code snippets for Data Engineering Design Patterns book☆27Updated this week
- Project for "Data pipeline design patterns" blog.☆41Updated last month
- build dw with dbt☆26Updated last month
- A simple and easy to use Data Quality (DQ) tool built with Python.☆45Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆54Updated last month
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆24Updated 7 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆68Updated last year
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆54Updated last year
- Code for "Efficient Data Processing in Spark" Course☆212Updated 3 months ago
- Quickstart for any service☆110Updated this week
- Data pipeline that scrapes Rust cheater Steam profiles☆50Updated 2 years ago
- Repo for saving cheat sheets☆42Updated 3 months ago
- Delta-Lake, ETL, Spark, Airflow☆42Updated last year
- Data Engineering examples covering Airflow and Mage for workflows; dbt for BigQuery, Redshift, ClickHouse; Spark and Kafka for Batch/Stre…☆50Updated 3 weeks ago
- Delta Lake Documentation☆45Updated 3 months ago
- ☆12Updated last month
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆122Updated 2 years ago
- ☆132Updated this week