factorhouse / examplesLinks
Feature demos, integration guides & hands-on labs/projects using Kpow, Flex, Kafka, Flink, Iceberg & more
☆50Updated this week
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- Docker Compose environments for developing modern data platform architectures using Kafka, Flink, Spark, Iceberg, OpenLineage, OpenMetada…☆51Updated last week
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆140Updated 2 weeks ago
- ☆65Updated last year
- ☆110Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 3 years ago
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Updated 3 years ago
- Code for dbt tutorial☆167Updated 4 months ago
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- Code snippets for Data Engineering Design Patterns book☆324Updated last month
- 📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.☆59Updated last year
- The Open-Source Enterprise Data Platform in a single Portal☆264Updated this week
- ☆80Updated last year
- Delta Lake Documentation☆53Updated last year
- Yet Another (Spark) ETL Framework☆21Updated 2 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆40Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Updated last year
- Delta Lake examples☆238Updated last year
- Quick Guides from Dremio on Several topics☆81Updated 2 months ago
- Building Big Data Pipelines with Apache Beam, published by Packt☆89Updated 2 years ago
- ☆79Updated last week
- Execution of DBT models using Apache Airflow through Docker Compose☆126Updated 3 years ago
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 3 years ago
- Simple stream processing pipeline☆110Updated last year
- New generation opensource data stack☆76Updated 3 years ago
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆57Updated 3 years ago
- ☆59Updated last year
- Data engineering with dbt, published by Packt☆89Updated 5 months ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- ☆214Updated last year
- Resources for video demonstrations and blog posts related to DataOps on AWS☆183Updated 4 years ago