Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
☆120Dec 15, 2023Updated 2 years ago
Alternatives and similar repositories for Real-time-Data-Warehouse
Users that are interested in Real-time-Data-Warehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆24Jan 16, 2024Updated 2 years ago
- ☆17Nov 26, 2024Updated last year
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Apr 23, 2026Updated 2 weeks ago
- ☆11Nov 26, 2024Updated last year
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Sep 13, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆66Sep 23, 2023Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 11 months ago
- The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are c…☆913Jan 12, 2026Updated 3 months ago
- 一个实时数仓项目,从0到1搭建实时数仓☆63May 27, 2021Updated 4 years ago
- A repository used in a NiFi Registry demo☆13Mar 11, 2020Updated 6 years ago
- Examples of Flink on Azure☆54Oct 30, 2023Updated 2 years ago
- Kinesis Connector for Spark Structured Streaming☆10Dec 26, 2023Updated 2 years ago
- ☆176Sep 5, 2023Updated 2 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Sep 26, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- Simple akka cluster example.☆12Mar 13, 2015Updated 11 years ago
- ☆16May 1, 2023Updated 3 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Self-contained demo using Flink SQL and Debezium to build a CDC-based analytics pipeline. All you need is Docker!☆26May 11, 2021Updated 4 years ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆312Apr 30, 2026Updated last week
- Apache flink☆19Feb 8, 2023Updated 3 years ago
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆66Mar 9, 2024Updated 2 years ago
- Local-first GitHub dashboard for maintainers to triage, review, and merge PRs and issues across repos without needing GitHub's built-in n…☆81May 3, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This project shows how to capture changes from postgres database and stream them into kafka☆42May 17, 2024Updated last year
- flink iceberg integration tests, jobs running on yarn.☆37Apr 6, 2021Updated 5 years ago
- The Internals of Apache Kafka☆58Dec 19, 2023Updated 2 years ago
- 基于flink的实时流计算web平台☆1,865Dec 2, 2025Updated 5 months ago
- AI 时代的智能数据库☆222Nov 9, 2023Updated 2 years ago
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆21Jul 26, 2024Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆100Mar 19, 2024Updated 2 years ago
- Repository containing Docker images for Spark master and slave☆15Nov 3, 2019Updated 6 years ago
- My Dota 2 Bot Script☆11Jun 6, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive M…☆47Dec 19, 2024Updated last year
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated last year
- 汇总Apache Hudi相关资料☆558Mar 31, 2026Updated last month
- Local AWS EMR - A local service that imitates AWS EMR☆27Jul 5, 2023Updated 2 years ago
- A data generator source connector for Flink SQL based on data-faker.☆236Jul 24, 2023Updated 2 years ago
- Example applications in Java, Python and SQL for Kinesis Data Analytics, demonstrating sources, sinks, and operators.☆147May 21, 2024Updated last year
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,130Updated this week