pmoskovi / flink-learning-resourcesLinks
A curated list of Apache Flink learning resources
β119Updated last year
Alternatives and similar repositories for flink-learning-resources
Users that are interested in flink-learning-resources are comparing it to the libraries listed below
Sorting:
- π Tech blogs & talks by companies that run Apache Flink in productionβ187Updated last month
- β107Updated 11 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureβ126Updated 2 months ago
- A Python Library to support running data quality rules while the spark job is runningβ‘β193Updated last week
- β268Updated last year
- Simple stream processing pipelineβ110Updated last year
- Delta Lake examplesβ236Updated last year
- Code snippets for Data Engineering Design Patterns bookβ307Updated last week
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Pythonβ44Updated 3 years ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkβ97Updated last year
- β64Updated last year
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)β64Updated 2 years ago
- Spark style guideβ271Updated last year
- Don't Panic. This guide will help you when it feels like the end of the world.β30Updated 4 months ago
- Code snippets used in demos recorded for the blog.β37Updated last month
- A Python package to submit and manage Apache Spark applications on Kubernetes.β46Updated 5 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflowβ80Updated 3 weeks ago
- Materials of the Official Helm Chart Webinarβ27Updated 4 years ago
- Weekly Data Engineering Newsletterβ96Updated last year
- Apache Hive Metastore as a Standalone server in Dockerβ80Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.β169Updated 3 months ago
- Spark on Kubernetes using Helmβ33Updated 5 years ago
- Apache Flink Demo Projectsβ44Updated last month
- A curated list of open source tools used in analytics platforms and data engineering ecosystemβ422Updated 10 months ago
- β81Updated 8 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copyβ83Updated 8 months ago
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.β91Updated last year
- Delta Lake Documentationβ51Updated last year
- System Design, Solution Architecture, Data Systems Practiceβ66Updated 4 months ago
- The official repository for the Rock the JVM Spark Optimization 2 courseβ42Updated 2 years ago