limadelrey / kafka-connect-cdc-medium
Kafka Connect: How to create a real time data pipeline using Change Data Capture (CDC)
☆13Updated 4 years ago
Alternatives and similar repositories for kafka-connect-cdc-medium:
Users that are interested in kafka-connect-cdc-medium are comparing it to the libraries listed below
- Docker envinroment to stream data from Kafka to Iceberg tables☆26Updated last year
- Sample code that shows the important aspects of developing custom connectors for Kafka Connect. It provides the resources for building, d…☆54Updated 9 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- ☆45Updated 4 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- ☆16Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆25Updated 7 months ago
- Source code for the YouTube video, Apache Beam Explained in 12 Minutes☆21Updated 4 years ago
- This repository contains the components that I use for my Youtube Kafka videos☆32Updated last year
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆19Updated 2 years ago
- This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. Ther…☆21Updated this week
- Yet Another (Spark) ETL Framework☆20Updated last year
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- ☆17Updated 2 years ago
- Presto Trino with Apache Hive Postgres metastore☆40Updated 6 months ago
- Generative AI in realtime with Confluent Cloud.☆22Updated 11 months ago
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆23Updated 5 months ago
- ☆53Updated 7 months ago
- CDC with NiFi, Kafka Connect, Flink SQL, Cloudera Data in Motion☆12Updated last year
- Data Engineering with Scala, published by Packt☆23Updated last year
- dbt package for monitoring airflow DAGs and tasks☆29Updated last month
- Building Big Data Pipelines with Apache Beam, published by Packt☆86Updated 2 years ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆19Updated this week
- This repository contains the source code for samples featured in eventdrivenutopia.com☆46Updated 2 years ago
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Updated 2 years ago
- Effective Kafka☆52Updated 2 years ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆85Updated 11 months ago
- Apache Flink/Apache Kafka streaming data analytics demonstration using Streaming Synthetic Sales Data Generator☆12Updated 9 months ago