This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics and machine learning workloads.
☆24Jan 16, 2024Updated 2 years ago
Alternatives and similar repositories for flink-iceberg-minio-trino
Users that are interested in flink-iceberg-minio-trino are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆25Sep 29, 2025Updated 7 months ago
- Iceberg开发指南,集成数据湖Iceberg在Spark、Flink引擎的等使用示例☆13Oct 8, 2022Updated 3 years ago
- ☆13Jun 10, 2024Updated last year
- Apache Hive Metastore in Standalone Mode With Docker☆14Jul 22, 2024Updated last year
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆101Jan 31, 2023Updated 3 years ago
- This repo is an approach to TDD in machine learning model operation. it covers project structure, testing essentials using pytest with Gi…☆15Dec 2, 2020Updated 5 years ago
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆120Dec 15, 2023Updated 2 years ago
- Demo from NEO4j's Connections: Healthcare & Life Sciences event☆12Jun 30, 2020Updated 5 years ago
- The home of Floecat: A catalog of catalogs for open table formats☆81Updated this week
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- Postgresql configured to work as metastore for Hive.☆32Dec 16, 2022Updated 3 years ago
- ☆14Apr 20, 2018Updated 8 years ago
- ☆36Feb 22, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Oozie - workflow engine for Hadoop☆17Jul 8, 2020Updated 5 years ago
- zsxq api☆12Jul 8, 2023Updated 2 years ago
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- ☆15Aug 16, 2017Updated 8 years ago
- Apache Ambari Web 中文汉化 2.7.x版本直接修改☆41Jan 2, 2023Updated 3 years ago
- Repository for the dbt Semantic Layer course☆14May 12, 2026Updated last week
- It's a Home page of saas base website, that content dashboard and subscription plan with crazy animation and layout.☆12Jul 1, 2024Updated last year
- An open-source, community-driven REST catalog for Apache Iceberg!☆30Jun 26, 2024Updated last year
- 🌪️ AI research assistant that generates Wikipedia-quality articles through multi-perspective analysis. Based on Stanford's STORM methodo…☆61Jun 6, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆66Sep 23, 2023Updated 2 years ago
- Lance Namespace is an open specification for describing access and operations against a collection of tables in a multimodal lakehouse☆53Updated this week
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Spark integrations for working with Lance datasets☆48Updated this week
- The Internals of Apache Kafka☆59Dec 19, 2023Updated 2 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- ☆30Dec 4, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆18Feb 11, 2017Updated 9 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆85Apr 12, 2025Updated last year
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 5 months ago
- Groovy client library for Apache Ambari's REST API☆20Jun 25, 2021Updated 4 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- Asynchronous flink connector based on the Lettuce, supporting sql join and sink, query caching and debugging.☆262Apr 15, 2025Updated last year