1ambda / lakehouseView external linksLinks
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
☆65Sep 23, 2023Updated 2 years ago
Alternatives and similar repositories for lakehouse
Users that are interested in lakehouse are comparing it to the libraries listed below
Sorting:
- Proof of concept of a big data cluster using open source tools☆11Apr 10, 2024Updated last year
- Gitbook Repo for Practical Data Pipeline☆25Feb 4, 2022Updated 4 years ago
- Run an open-source data LakeHouse locally using Docker Compose☆12May 31, 2024Updated last year
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆75Sep 2, 2023Updated 2 years ago
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆119Dec 15, 2023Updated 2 years ago
- domain driven design in Go☆14Aug 18, 2020Updated 5 years ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 9 months ago
- Inference API server with echo and gRPC to triton server (golang)☆13Nov 16, 2022Updated 3 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated last year
- A suite of tools aimed at making Iceberg REST Catalogs more approachable☆26Updated this week
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 3 years ago
- Elastic Stack Data Pipeline 구축 실습☆19Nov 20, 2021Updated 4 years ago
- Object Mapping Framework for NoSQL databases☆22Aug 15, 2025Updated 6 months ago
- Apache Atlas client☆18Jan 10, 2026Updated last month
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Jul 13, 2022Updated 3 years ago
- 패스트캠퍼스, 파이썬을 이용한 머신러닝 입문 실습 코드☆21Sep 25, 2020Updated 5 years ago
- ☆23Nov 17, 2022Updated 3 years ago
- ☆23Jun 30, 2024Updated last year
- RisingWave Console is a simple tool for managing on-prem RisingWave clusters.☆29Updated this week
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆28Jul 2, 2022Updated 3 years ago
- ☆25Mar 15, 2024Updated last year
- ☆25Jul 2, 2022Updated 3 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Feb 27, 2024Updated last year
- Spring Boot based Detecting SlowQuery & Query Statistics Example using DBCP2 Proxy☆26Jan 7, 2016Updated 10 years ago
- ☆11Oct 1, 2025Updated 4 months ago
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Jun 16, 2025Updated 7 months ago
- asw.cluster R package for calculating group faultlines☆12Aug 20, 2023Updated 2 years ago
- fine-tuning tutorial☆17Dec 13, 2025Updated 2 months ago
- Converters between JodaTime and Java 8 time classes, plus some legacy java.util time classes utility, for Java 8 and Scala☆13Oct 18, 2016Updated 9 years ago
- Less-Resilient MapReduce framework for Go☆36Jan 17, 2024Updated 2 years ago
- This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The fictici…☆14Sep 30, 2024Updated last year
- DOS Program Development☆12Nov 9, 2022Updated 3 years ago
- ☆32Mar 7, 2018Updated 7 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆41May 17, 2024Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆258Dec 13, 2025Updated 2 months ago
- Running JupyterHub on Kubernetes (AWS EKS) in 30 minutes☆35Oct 26, 2019Updated 6 years ago
- Data pipeline example written in Rust with Polars and DataFusion DataFrame package☆41Mar 12, 2023Updated 2 years ago
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year