factorhouse / factorhouse-localLinks
Docker Compose environments for demonstrating modern data platform architectures using Kafka, Flink, Spark, Iceberg, Pinot + Kpow & Flex by Factor House
☆29Updated 2 weeks ago
Alternatives and similar repositories for factorhouse-local
Users that are interested in factorhouse-local are comparing it to the libraries listed below
Sorting:
- Feature demos, integration guides & hands-on labs/projects using Kpow, Flex, Kafka, Flink, Iceberg & more☆27Updated this week
- ☆93Updated 7 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆92Updated 2 months ago
- Code snippets for Data Engineering Design Patterns book☆151Updated 5 months ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆173Updated last week
- The Open-Source Enterprise Data Platform in a single Portal☆256Updated this week
- Delta Lake Documentation☆49Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆70Updated 3 weeks ago
- ☆59Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated last year
- Delta Lake examples☆227Updated 11 months ago
- Quick Guides from Dremio on Several topics☆74Updated 2 weeks ago
- Edit your data contract in the Data Contract Editor☆25Updated 10 months ago
- Don't Panic. This guide will help you when it feels like the end of the world.☆27Updated 2 months ago
- A curated list of Apache Flink learning resources☆88Updated 8 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆220Updated 4 months ago
- Terraform Provider for Airbyte API☆59Updated 2 months ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆32Updated last year
- ☆80Updated 10 months ago
- Spark runtime on AWS Lambda☆109Updated last week
- ☆80Updated 4 months ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆165Updated 9 months ago
- Open source stack lakehouse☆26Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- A Python Library to support running data quality rules while the spark job is running⚡☆188Updated this week
- In-Memory Analytics for Kafka using DuckDB☆137Updated this week
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Updated 3 years ago
- ☆23Updated 4 years ago