Repo for CDC with debezium blog post
☆30Sep 15, 2024Updated last year
Alternatives and similar repositories for change_data_capture
Users that are interested in change_data_capture are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cost Efficient Data Pipelines with DuckDB☆61May 14, 2025Updated last year
- A custom end-to-end analytics platform for customer churn☆10May 15, 2025Updated last year
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆109May 26, 2026Updated last month
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- Step by step instructions to create a production-ready data pipeline☆62Dec 23, 2024Updated last year
- ☆14Dec 11, 2023Updated 2 years ago
- Simple stream processing pipeline☆112Jun 17, 2024Updated 2 years ago
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- Code for my "Efficient Data Processing in SQL" book.☆63Aug 6, 2024Updated last year
- End to end data engineering project☆59Oct 27, 2022Updated 3 years ago
- Primary repository for NYC DCP's Data Engineering team☆42Jun 26, 2026Updated last week
- Repo containing all of my Data engineering projects☆14May 4, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Using Debezium to capture data changes from databases and populate these as historic evolution and table replication in Snowflake☆24Oct 26, 2023Updated 2 years ago
- Extension for DuckDB for functions that require the Apache Arrow dependency☆46May 12, 2025Updated last year
- DWH powered by Clickhouse and dbt☆13Aug 4, 2024Updated last year
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆25Apr 27, 2023Updated 3 years ago
- A Datasource provider based on DuckDB for analytics/pivot tables☆24Feb 23, 2025Updated last year
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆295Jul 11, 2024Updated last year
- An incubating Debezium CDC connector for for IBM i (AS/400). Please log issues at https://github.com/debezium/dbz/issues.☆20Jun 23, 2026Updated last week
- Code for dbt tutorial☆181Jun 4, 2026Updated last month
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated 2 years ago
- Consume debezium events to databend☆21Apr 7, 2024Updated 2 years ago
- Demo of orchestrating Airbyte connections with Prefect☆11Mar 3, 2022Updated 4 years ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- Experimental ClickHouse Native Client and Native file reader Extension for DuckDB chsql☆20Feb 18, 2026Updated 4 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆32Updated this week
- PostgreSQL DB2 Data Wrapper☆29Jun 3, 2026Updated last month
- Visualizations and predictive analytics which uses statsbomb and other data sources to visualize metrics like shot location, through ball…☆11May 17, 2021Updated 5 years ago
- Running ClickHouse like it's BigQuery☆39Aug 24, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- VSCode extension to bring Kestra's autocompletion to your IDE☆14Updated this week
- Replicache Diff Server☆14Mar 8, 2021Updated 5 years ago
- Zod 4 to Zod 3 runtime converter☆18Sep 5, 2025Updated 9 months ago
- dbt-prql allows writing PRQL in dbt models☆108May 26, 2026Updated last month
- Apache arrow examples in golang☆15Apr 27, 2021Updated 5 years ago
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆39Feb 19, 2026Updated 4 months ago
- Web based playground for Apache DataFusion via WASM☆20May 18, 2025Updated last year