factorhouse / factorhouse-localLinks
Docker Compose environments for demonstrating modern data platform architectures using Kafka, Flink, Spark, Iceberg, Pinot + Kpow & Flex by Factor House
☆42Updated 2 weeks ago
Alternatives and similar repositories for factorhouse-local
Users that are interested in factorhouse-local are comparing it to the libraries listed below
Sorting:
- Feature demos, integration guides & hands-on labs/projects using Kpow, Flex, Kafka, Flink, Iceberg & more☆39Updated this week
- ☆60Updated last year
- ☆99Updated 8 months ago
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆176Updated last month
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated 2 years ago
- In-Memory Analytics for Kafka using DuckDB☆138Updated last week
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆84Updated 3 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated last year
- Open source stack lakehouse☆25Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆165Updated 3 weeks ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆32Updated last year
- ☆80Updated 5 months ago
- Command-line interface to quickly generate fake CSV and JSON data☆76Updated last year
- Multi-hop declarative data pipelines☆120Updated 2 weeks ago
- Code snippets for Data Engineering Design Patterns book☆207Updated 6 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆108Updated 3 months ago
- The Open-Source Enterprise Data Platform in a single Portal☆260Updated this week
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- Apache Flink (Pyflink) and Related Projects☆41Updated 5 months ago
- A list of all awesome open-source contributions for the Apache Kafka project☆106Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- Spark runtime on AWS Lambda☆110Updated last month
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 3 years ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- ☆34Updated last week
- ☆39Updated 5 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Updated last year
- Collection of code examples for Amazon Managed Service for Apache Flink