A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset
☆20Sep 29, 2025Updated 5 months ago
Alternatives and similar repositories for apache_flink_and_iceberg
Users that are interested in apache_flink_and_iceberg are comparing it to the libraries listed below
Sorting:
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆23Jan 16, 2024Updated 2 years ago
- Apache iceberg Spark s3 examples☆21Mar 1, 2024Updated 2 years ago
- Repository for the dbt Semantic Layer course☆12Updated this week
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated 2 weeks ago
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- Material for a course on applied machine-learning for scientists. Taught at EPFL in spring 2018.☆11May 3, 2018Updated 7 years ago
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- This sample illustrates using data compression with AWS Lambda functions☆11Apr 10, 2025Updated 10 months ago
- Spring Security training☆11Jun 9, 2024Updated last year
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 3 months ago
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated 10 months ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- High throughput streaming of Protobuf data from Kafka into DuckDB☆12Updated this week
- ☆14Aug 21, 2021Updated 4 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 3 months ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Iceberg Playground in a Box☆67Jun 27, 2025Updated 8 months ago
- Website for jbang.dev☆13Feb 18, 2026Updated 2 weeks ago
- JAIG = Java AI-powered Code Generator☆14Oct 10, 2024Updated last year
- Apache Hive Metastore in Standalone Mode With Docker☆14Jul 22, 2024Updated last year
- Wrapper for "Chatterbot Eliza 2.0" made by Gonzales Cenelia. With is a C++ implementation of this well know chatbot☆13Sep 23, 2012Updated 13 years ago
- A Quarkus CLI that can merge PDF iles☆12Sep 13, 2023Updated 2 years ago
- Template for Demos with Apache Spark, Dremio, Minio and Nessie☆13Sep 28, 2024Updated last year
- Homework for the assignments of the book Exercises in Programming Style☆10Sep 7, 2019Updated 6 years ago
- A Trino ODBC driver☆14Jan 10, 2024Updated 2 years ago
- ☆15Dec 11, 2023Updated 2 years ago
- Slightly Streamlined AWS Cloud Development Kit (CDK) Boilerplate for Java☆14Nov 17, 2025Updated 3 months ago
- Arrow-Powered Data Exchange☆15Feb 7, 2025Updated last year
- ☆16Jul 26, 2018Updated 7 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- JBang examples to get started with Pi4J V2☆10Feb 20, 2026Updated 2 weeks ago
- Step-by-step guide to curl operations☆18Aug 17, 2025Updated 6 months ago
- End-to-end data platform leveraging the Modern data stack☆52Apr 10, 2024Updated last year
- Quarkus Amazon CloudWatch Logging extension☆16Updated this week