Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
☆119Dec 15, 2023Updated 2 years ago
Alternatives and similar repositories for Real-time-Data-Warehouse
Users that are interested in Real-time-Data-Warehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆23Jan 16, 2024Updated 2 years ago
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Updated this week
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Sep 13, 2020Updated 5 years ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 10 months ago
- 这是一个Flink实时数仓项目☆21Jul 28, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 基于RED5流媒体服务器+ckplay实现的在线直播、视频☆14Mar 27, 2016Updated 10 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 3 years ago
- The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are c…☆912Jan 12, 2026Updated 2 months ago
- 一个实时数仓项目,从0到1搭建实时数仓☆63May 27, 2021Updated 4 years ago
- adidas Data Mesh implementation☆12May 13, 2022Updated 3 years ago
- ☆175Sep 5, 2023Updated 2 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Sep 26, 2023Updated 2 years ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆11Nov 18, 2023Updated 2 years ago
- Simple akka cluster example.☆12Mar 13, 2015Updated 11 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Self-contained demo using Flink SQL and Debezium to build a CDC-based analytics pipeline. All you need is Docker!☆26May 11, 2021Updated 4 years ago
- Apache flink☆18Feb 8, 2023Updated 3 years ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- CDP examples and tutorials☆19Jun 10, 2025Updated 9 months ago
- The code for computer science☆37Oct 8, 2024Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆42May 17, 2024Updated last year
- flink iceberg integration tests, jobs running on yarn.☆37Apr 6, 2021Updated 4 years ago
- 基于flink的实时流计算web平台☆1,866Dec 2, 2025Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 基于Mysql协议的数据库中间件,支持SQL查询Mysql、Oracle、Clickhouse、Excel、elasticsearch,基于calcite进行SQL解析,并对SQL进行扩展☆14Aug 5, 2024Updated last year
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆21Jul 26, 2024Updated last year
- Repository containing Docker images for Spark master and slave☆15Nov 3, 2019Updated 6 years ago
- flink-connector-opengauss (unofficial)☆10Sep 25, 2021Updated 4 years ago
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated last year
- 汇总Apache Hudi相关资料☆558Jan 4, 2026Updated 2 months ago
- A data generator source connector for Flink SQL based on data-faker.☆235Jul 24, 2023Updated 2 years ago
- Example applications in Java, Python and SQL for Kinesis Data Analytics, demonstrating sources, sinks, and operators.☆147May 21, 2024Updated last year
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,122Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆118Apr 21, 2023Updated 2 years ago
- Instant access to the Spark cluster from anywhere☆16Nov 10, 2020Updated 5 years ago
- Playground for Flink Table Store with use cases and performance features☆51Apr 18, 2023Updated 2 years ago
- This project provides a reverse proxy for Spark UI on Kubernetes☆17Oct 12, 2023Updated 2 years ago
- Upserts, Deletes And Incremental Processing on Big Data.☆6,126Updated this week
- duckdb-etl-framework☆15Dec 20, 2024Updated last year
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 3 years ago