factorhouse / examplesLinks
Feature demos, integration guides & hands-on labs/projects using Kpow, Flex, Kafka, Flink, Iceberg & more
☆40Updated last month
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- Docker Compose environments for demonstrating modern data platform architectures using Kafka, Flink, Spark, Iceberg, Pinot + Kpow & Flex …☆46Updated last month
- Code for dbt tutorial☆165Updated 2 months ago
- Code snippets for Data Engineering Design Patterns book☆271Updated 8 months ago
- ☆104Updated 10 months ago
- ☆80Updated last year
- New generation opensource data stack☆75Updated 3 years ago
- Cloned by the `dbt init` task☆62Updated last year
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆123Updated last week
- ☆62Updated last year
- Execution of DBT models using Apache Airflow through Docker Compose☆124Updated 2 years ago
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Updated 3 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆79Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆60Updated 6 months ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- build dw with dbt☆49Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- DataOps Observability is part of DataKitchen's Open Source Data Observability. DataOps Observability monitors every data journey from da…☆50Updated 2 weeks ago
- Data engineering with dbt, published by Packt☆87Updated 2 months ago
- Simple stream processing pipeline☆110Updated last year
- Spark data pipeline that processes movie ratings data.☆30Updated 2 weeks ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- ☆30Updated last year
- Generate descriptions of Snowflake tables and views with LLMs☆26Updated 5 months ago
- 📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.☆52Updated 10 months ago
- Sample project to demonstrate data engineering best practices☆198Updated last year
- The Open-Source Enterprise Data Platform in a single Portal☆260Updated this week
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆60Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆46Updated last year