factorhouse / examplesLinks
Feature demos, integration guides & hands-on labs/projects using Kpow, Flex, Kafka, Flink, Iceberg & more
☆27Updated this week
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- Docker Compose environments for demonstrating modern data platform architectures using Kafka, Flink, Spark, Iceberg, Pinot + Kpow & Flex …☆29Updated 2 weeks ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- Code snippets for Data Engineering Design Patterns book☆151Updated 5 months ago
- The Open-Source Enterprise Data Platform in a single Portal☆256Updated this week
- ☆93Updated 7 months ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- ☆206Updated 7 months ago
- 📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.☆49Updated 7 months ago
- New generation opensource data stack☆72Updated 3 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆38Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated last year
- Delta Lake Documentation☆49Updated last year
- ☆80Updated 10 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆92Updated 2 months ago
- Delta Lake examples☆227Updated 11 months ago
- Cloud Dataproc: Samples and Utils☆11Updated 4 years ago
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆173Updated last week
- Yet Another (Spark) ETL Framework☆21Updated last year
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆45Updated 9 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆40Updated last year
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆179Updated last year
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆57Updated 2 years ago
- ☆59Updated last year
- Local Environment to Practice Data Engineering☆143Updated 8 months ago
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Updated 3 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆118Updated 2 years ago
- Cloned by the `dbt init` task☆61Updated last year
- build dw with dbt☆47Updated 10 months ago