leartbeqiraj1 / cdc-postgresql-clickhouseLinks
This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a ClickHouse database through Apache Kafka. Using Debezium to capture and stream database changes, and Kafka Connect to sink the data into ClickHouse, this pipeline allows for efficient data synchronization.
☆12Updated last year
Alternatives and similar repositories for cdc-postgresql-clickhouse
Users that are interested in cdc-postgresql-clickhouse are comparing it to the libraries listed below
Sorting:
- dlt-dagster-demo☆11Updated last year
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆20Updated 3 years ago
- resources for trying out a nessie-flink-iceberg setup☆11Updated last year
- Docker envinroment to stream data from Kafka to Iceberg tables☆29Updated last year
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆28Updated 8 months ago
- This repository contains recipes for Apache Pinot.☆30Updated 3 months ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Updated last year
- This is where to start the data transformation with dbt and PostgreSQL☆8Updated 3 years ago
- dbt package for monitoring airflow DAGs and tasks☆29Updated 3 months ago
- Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services☆28Updated 2 months ago
- Apache flink☆13Updated 6 months ago
- A leightweight UI for Lakekeeper☆12Updated this week
- Connectors for capturing data from external data sources☆65Updated this week
- Build Data Lake using Open Source tools☆100Updated last week
- A modern, enterprise-ready business intelligence web application. Unleash the value of your data. 📈 📉 📊☆33Updated 2 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆9Updated last year
- Trino On K8S Via Helm & Metastore Workshop Querying Delta Tables☆9Updated 4 months ago
- Schema Registry Statistics Tool☆24Updated last week
- Analytics engineering with dbt - projects and developer environment☆18Updated 8 months ago
- Collection of assets used for various articles at https://blogs.min.io☆38Updated 2 months ago
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆63Updated this week
- Deploy multiple Dagster data pipelines on Docker environment☆22Updated last year
- Superset with keycloak integration using OpenID Connect☆30Updated last year
- A repository to store recipes, custom sources, transformations and other things to make your DataHub experience magical☆12Updated 2 years ago
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆46Updated 3 years ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- A RESTful schema registry☆13Updated 4 months ago
- A simple composable rule engine, built in object-oriented way, to reduce manual work.☆17Updated last year
- A free, open-source, web-based self-service BI tailor-made for clickhouse, google bigquery, mysql, postgresql, vertica☆112Updated 6 months ago
- ☆10Updated 2 years ago