Repo for CDC with debezium blog post
☆29Sep 15, 2024Updated last year
Alternatives and similar repositories for change_data_capture
Users that are interested in change_data_capture are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cost Efficient Data Pipelines with DuckDB☆63May 14, 2025Updated 11 months ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 11 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆104Jun 7, 2024Updated last year
- Repository for Data Engineering Interview Series☆37Oct 17, 2024Updated last year
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Dec 11, 2023Updated 2 years ago
- ☆16Apr 26, 2024Updated 2 years ago
- Simple stream processing pipeline☆112Jun 17, 2024Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆62Aug 6, 2024Updated last year
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- Project for "Data pipeline design patterns" blog.☆51Aug 6, 2024Updated last year
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 3 years ago
- A Datasource provider based on DuckDB for analytics/pivot tables☆24Feb 23, 2025Updated last year
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆291Jul 11, 2024Updated last year
- An incubating Debezium CDC connector for for IBM i (AS/400). Please log issues at https://github.com/debezium/dbz/issues.☆19Apr 22, 2026Updated last week
- Code for dbt tutorial☆174Sep 9, 2025Updated 7 months ago
- A set of functions to visualize college football teams in 'ggplot2'☆11Nov 29, 2023Updated 2 years ago
- iThome 13th-ironman (2021) - Data Science Learning Roadmap about Python☆16Mar 9, 2022Updated 4 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 一服見效的 AI 應用☆14Nov 21, 2019Updated 6 years ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆31Apr 27, 2026Updated last week
- PostgreSQL DB2 Data Wrapper☆29Apr 19, 2026Updated 2 weeks ago
- Beginner data engineering project - batch edition☆581Apr 13, 2026Updated 3 weeks ago
- This project leverages Hadoop, Spark, SQL, and Hive for efficient data integration, transformation, warehousing, and analytics. It provid…☆23Sep 30, 2023Updated 2 years ago
- Replicache Diff Server☆14Mar 8, 2021Updated 5 years ago
- dbt-prql allows writing PRQL in dbt models☆108Apr 27, 2026Updated last week
- Apache arrow examples in golang☆15Apr 27, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pub-Sub between PostgreSQL and Redis in Python☆11May 22, 2023Updated 2 years ago
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆39Feb 19, 2026Updated 2 months ago
- Plumbing, an alternative to subclassing☆15Feb 3, 2026Updated 3 months ago
- create dynamic pipelines on github workflows☆18Apr 16, 2026Updated 2 weeks ago
- ☆13Apr 27, 2026Updated last week
- Explore the dbt Semantic Layer☆31May 26, 2025Updated 11 months ago
- Generate massive fake datasets for your datalake, fast. By SOMA☆20Apr 17, 2026Updated 2 weeks ago