This project shows how to capture changes from postgres database and stream them into kafka
☆42May 17, 2024Updated 2 years ago
Alternatives and similar repositories for changecapture-e2e
Users that are interested in changecapture-e2e are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆48Dec 11, 2023Updated 2 years ago
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆11Oct 11, 2023Updated 2 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆14Dec 27, 2023Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆45Jan 4, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆48Mar 14, 2024Updated 2 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆329Feb 14, 2025Updated last year
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆51Dec 4, 2023Updated 2 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆20Apr 25, 2024Updated 2 years ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆215Oct 23, 2023Updated 2 years ago
- An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Az…☆30Oct 2, 2023Updated 2 years ago
- Data Engineering Bootcamp☆31Aug 5, 2025Updated 10 months ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An end-to-end, containerized data pipeline for near-real-time user event analytics using Kafka, ClickHouse, Airflow, and PySpark. Made to…☆79Sep 12, 2025Updated 8 months ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- Building a Data Pipeline with an Open Source Stack☆59Jun 27, 2025Updated 11 months ago
- Data Engineering with Scala, published by Packt☆28Apr 22, 2026Updated last month
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆18Dec 26, 2023Updated 2 years ago
- Repository to host micro service implementation patterns.☆14Jun 25, 2025Updated 11 months ago
- This is an end to end MLOps system☆34Nov 27, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository showcases a collection of machine learning projects in various domains, demonstrating my skills and expertise as a data s…☆12Nov 20, 2023Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆146Jul 27, 2023Updated 2 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- This a is a simple fortune teller like app which tells what 2 people are, it does this based on the letters in both names. The given answ…☆12Mar 15, 2021Updated 5 years ago
- [ESWC '24] This repo is official implementation for the paper "Towards Harnessing Large Language Models as Autonomous Agents for Semantic…☆10May 25, 2024Updated 2 years ago
- GPT-4o Powered Calorie Detecor☆18May 29, 2024Updated 2 years ago
- Practice course on Big Data☆18May 16, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Analyzing Video Assistant Referee (VAR) decisions in the English Premier League (2019 - 2021)☆12Jun 1, 2021Updated 5 years ago
- This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, …☆35Mar 12, 2026Updated 3 months ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- Superstore Sales with Streamlit is a data visualization and analysis project that uses the Streamlit framework to create an interactive w…☆24Aug 24, 2023Updated 2 years ago
- Transparent sandbox for integration testing against AWS services. Test your infrastructure without changes to your Terraform files or you…☆12Oct 26, 2023Updated 2 years ago
- A platform that helps developers to better understand CSS through declaration interpretation and may even improve them through suggestion…☆14Jul 3, 2021Updated 4 years ago
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆52Feb 7, 2025Updated last year