izhangzhihao/Real-time-Data-Warehouse

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/izhangzhihao/Real-time-Data-Warehouse)

izhangzhihao / Real-time-Data-Warehouse

Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi

☆121

Alternatives and similar repositories for Real-time-Data-Warehouse

Users that are interested in Real-time-Data-Warehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pranav1699 / flink-iceberg-minio-trino
View on GitHub
This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…
☆25Jan 16, 2024Updated 2 years ago
MartijnVisser / flink-only-sql
View on GitHub
Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …
☆12Updated this week
e6data / awesome-optimizing-iceberg-tables
View on GitHub
☆17Nov 26, 2024Updated last year
aws-samples / apache-xtable-on-aws-samples
View on GitHub
☆11Jun 8, 2026Updated last month
leesf / hudi-demos
View on GitHub
汇总Apache Hudi中的一些Demo，便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)
☆74Sep 13, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
1ambda / lakehouse
View on GitHub
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
☆69Sep 23, 2023Updated 2 years ago
GTyingzi / Flink_Demo
View on GitHub
这是一个Flink实时数仓项目
☆21Jul 28, 2022Updated 3 years ago
tj--- / iceberg-demo
View on GitHub
A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino
☆22May 30, 2022Updated 4 years ago
ververica / flink-sql-cookbook
View on GitHub
The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are c…
☆915Jan 12, 2026Updated 6 months ago
fengchi66 / realtime-dw
View on GitHub
一个实时数仓项目，从0到1搭建实时数仓
☆64May 27, 2021Updated 5 years ago
kevdoran / fdlc-demo
View on GitHub
A repository used in a NiFi Registry demo
☆13Mar 11, 2020Updated 6 years ago
adidas / datamesh-sharing-data-at-scale
View on GitHub
adidas Data Mesh implementation
☆12May 13, 2022Updated 4 years ago
roncemer / spark-sql-kinesis
View on GitHub
Kinesis Connector for Spark Structured Streaming
☆10Dec 26, 2023Updated 2 years ago
twalthr / flink-api-examples
View on GitHub
Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.
☆65Sep 26, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
fhueske / flink-sql-demo
View on GitHub
☆176Sep 5, 2023Updated 2 years ago
dcaoyuan / akka-cluster-example-inloop
View on GitHub
Simple akka cluster example.
☆12Mar 13, 2015Updated 11 years ago
apache / flink-connector-redis-streams
View on GitHub
Apache flink
☆19May 15, 2026Updated 2 months ago
morsapaes / flink-sql-CDC
View on GitHub
Self-contained demo using Flink SQL and Debezium to build a CDC-based analytics pipeline. All you need is Docker!
☆26May 11, 2021Updated 5 years ago
memiiso / debezium-server-iceberg
View on GitHub
Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake
☆324Updated this week
damavis / advanced-airflow
View on GitHub
Apache Airflow advanced functionalities examples
☆21Mar 22, 2024Updated 2 years ago
behnamyazdan / ecommerce_realtime_data_pipeline
View on GitHub
Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)
☆71Mar 9, 2024Updated 2 years ago
spancer / flink-iceberg-demo
View on GitHub
flink iceberg integration tests, jobs running on yarn.
☆37Apr 6, 2021Updated 5 years ago
airscholar / changecapture-e2e
View on GitHub
This project shows how to capture changes from postgres database and stream them into kafka
☆41May 17, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
akarce / e2e-structured-streaming
View on GitHub
End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…
☆21Jul 26, 2024Updated last year
lealone-plugins / QinSQL
View on GitHub
AI 时代的智能数据库
☆221Nov 9, 2023Updated 2 years ago
egehanyorulmaz / reference_etl
View on GitHub
Tutorial for easy-to-manage data pipelines with Airflow
☆10Jun 26, 2022Updated 4 years ago
japila-books / kafka-internals
View on GitHub
The Internals of Apache Kafka
☆59Dec 19, 2023Updated 2 years ago
getindata / dbt-flink-adapter
View on GitHub
Adapter for dbt that executes dbt pipelines on Apache Flink
☆102Mar 19, 2024Updated 2 years ago
IvanWoo / trino-on-kubernetes
View on GitHub
☆10May 5, 2022Updated 4 years ago
bartosz25 / spark-docker
View on GitHub
Repository containing Docker images for Spark master and slave
☆15Nov 3, 2019Updated 6 years ago
edge-blade / Dota-2-AI-Bot-Experiment
View on GitHub
My Dota 2 Bot Script
☆11Jun 6, 2022Updated 4 years ago
dynonguyen / Data-Warehouse-UKAccident
View on GitHub
Information system for business project - building and mining data warehouse
☆10Jan 11, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
cloudera / tutorial-assets
View on GitHub
Assets used in Cloudera Tutorials
☆19Nov 22, 2021Updated 4 years ago
behnamyazdan / DockerForDataEngineers
View on GitHub
This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…
☆16May 23, 2024Updated 2 years ago
1ambda / practical-data-pipeline
View on GitHub
Gitbook Repo for Practical Data Pipeline
☆25Feb 4, 2022Updated 4 years ago
davlum / localemr
View on GitHub
Local AWS EMR - A local service that imitates AWS EMR
☆27Jul 5, 2023Updated 3 years ago
aws-samples / amazon-kinesis-data-analytics-examples
View on GitHub
Example applications in Java, Python and SQL for Kinesis Data Analytics, demonstrating sources, sinks, and operators.
☆147May 21, 2024Updated 2 years ago
knaufk / flink-faker
View on GitHub
A data generator source connector for Flink SQL based on data-faker.
☆237Jul 24, 2023Updated 2 years ago
apache / amoro
View on GitHub
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
☆1,151Updated this week