This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics and machine learning workloads.
☆23Jan 16, 2024Updated 2 years ago
Alternatives and similar repositories for flink-iceberg-minio-trino
Users that are interested in flink-iceberg-minio-trino are comparing it to the libraries listed below
Sorting:
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆20Sep 29, 2025Updated 5 months ago
- ☆33Oct 22, 2022Updated 3 years ago
- ☆13Jun 10, 2024Updated last year
- Apache Hive Metastore in Standalone Mode With Docker☆14Jul 22, 2024Updated last year
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated 10 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆102Jan 31, 2023Updated 3 years ago
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆119Dec 15, 2023Updated 2 years ago
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- Introdução a processos de validação mais complexos do que o hold out básico (train/test), como validação cruzada, grupos etc.☆13Nov 26, 2018Updated 7 years ago
- ☆14Apr 20, 2018Updated 7 years ago
- ☆36Feb 22, 2023Updated 3 years ago
- Unity Catalog Explorer is a TypeScript + Next.js based Web UI for the Unity Catalog OSS.☆13Jun 29, 2024Updated last year
- Java implementation for performing operations on Apache Iceberg and Hive tables☆20Sep 17, 2025Updated 6 months ago
- Dameng Connector 是一个专门为达梦数据库(DM Database)设计的变更数据捕获(CDC)解决方案。该项 目通过扩展 Debezium 和 Flink CDC,实现了对达梦数据库的实时数据变更监控、捕获和处理能力,为数据集成、数据同步、实时分析等场景提供强…☆67Dec 24, 2025Updated 2 months ago
- ☆15Aug 16, 2017Updated 8 years ago
- Apache Ambari Web 中文汉化 2.7.x版本直接修改☆41Jan 2, 2023Updated 3 years ago
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated last month
- An open-source, community-driven REST catalog for Apache Iceberg!☆30Jun 26, 2024Updated last year
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆66Sep 23, 2023Updated 2 years ago
- 🌪️ AI research assistant that generates Wikipedia-quality articles through multi-perspective analysis. Based on Stanford's STORM methodo…☆53Jun 6, 2025Updated 9 months ago
- Lance Namespace is an open specification for describing access and operations against a collection of tables in a multimodal lakehouse☆51Updated this week
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Spark integrations for working with Lance datasets☆46Updated this week
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆84Apr 12, 2025Updated 11 months ago
- ☆18Feb 11, 2017Updated 9 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 3 months ago
- Groovy client library for Apache Ambari's REST API☆20Jun 25, 2021Updated 4 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- Asynchronous flink connector based on the Lettuce, supporting sql join and sink, query caching and debugging.☆261Apr 15, 2025Updated 11 months ago
- ☆22Aug 24, 2020Updated 5 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 4 months ago
- 一个实时数仓项目,从0到1搭建实时数仓☆63May 27, 2021Updated 4 years ago
- 本项目使用gin、gorm和ssh开发。提供完善的cmdb、批量执行、作业管理、基础设施管理等功能,帮助基础运维同学快速、低成本、可视化、自动化的运维平台项目☆31May 22, 2025Updated 9 months ago