Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
☆120Dec 15, 2023Updated 2 years ago
Alternatives and similar repositories for Real-time-Data-Warehouse
Users that are interested in Real-time-Data-Warehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Updated this week
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Sep 13, 2020Updated 5 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆69Sep 23, 2023Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆10May 15, 2025Updated last year
- 这是一个Flink实时数仓项目☆21Jul 28, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are c…☆914Jan 12, 2026Updated 5 months ago
- A repository used in a NiFi Registry demo☆13Mar 11, 2020Updated 6 years ago
- Examples of Flink on Azure☆54Oct 30, 2023Updated 2 years ago
- Kinesis Connector for Spark Structured Streaming☆10Dec 26, 2023Updated 2 years ago
- ☆176Sep 5, 2023Updated 2 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Sep 26, 2023Updated 2 years ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- ☆16May 1, 2023Updated 3 years ago
- Self-contained demo using Flink SQL and Debezium to build a CDC-based analytics pipeline. All you need is Docker!☆26May 11, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Apache flink☆19May 15, 2026Updated last month
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆69Mar 9, 2024Updated 2 years ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- CDP examples and tutorials☆19Jun 10, 2025Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆42May 17, 2024Updated 2 years ago
- flink iceberg integration tests, jobs running on yarn.☆37Apr 6, 2021Updated 5 years ago
- The Internals of Apache Kafka☆59Dec 19, 2023Updated 2 years ago
- 基于flink的实时流计算web平台☆1,859Dec 2, 2025Updated 6 months ago
- AI 时代的智能数据库☆221Nov 9, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Javascript library to talk to multiple OLAP backends from multiple frontends☆17Feb 4, 2013Updated 13 years ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆100Mar 19, 2024Updated 2 years ago
- A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive M…☆47Dec 19, 2024Updated last year
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated 2 years ago
- 汇总Apache Hudi相关资料☆558Mar 31, 2026Updated 3 months ago
- Local AWS EMR - A local service that imitates AWS EMR☆27Jul 5, 2023Updated 2 years ago
- A data generator source connector for Flink SQL based on data-faker.☆238Jul 24, 2023Updated 2 years ago
- Example applications in Java, Python and SQL for Kinesis Data Analytics, demonstrating sources, sinks, and operators.☆147May 21, 2024Updated 2 years ago
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,145Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆116Apr 21, 2023Updated 3 years ago
- Assets used in Cloudera Tutorials☆19Nov 22, 2021Updated 4 years ago
- Qt 无线电监控软件☆13Dec 25, 2019Updated 6 years ago
- Instant access to the Spark cluster from anywhere☆16Nov 10, 2020Updated 5 years ago
- Playground for Flink Table Store with use cases and performance features☆51Apr 18, 2023Updated 3 years ago
- This project provides a reverse proxy for Spark UI on Kubernetes☆16Oct 12, 2023Updated 2 years ago
- Upserts, Deletes And Incremental Processing on Big Data.☆6,180Updated this week