pmoskovi / flink-learning-resources
A curated list of Apache Flink learning resources
β66Updated 4 months ago
Alternatives and similar repositories for flink-learning-resources
Users that are interested in flink-learning-resources are comparing it to the libraries listed below
Sorting:
- π Tech blogs & talks by companies that run Apache Flink in productionβ172Updated 3 months ago
- Code snippets used in demos recorded for the blog.β37Updated 2 weeks ago
- β53Updated 9 months ago
- Code snippets for Data Engineering Design Patterns bookβ106Updated last month
- β81Updated 4 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureβ72Updated 2 weeks ago
- Don't Panic. This guide will help you when it feels like the end of the world.β23Updated 11 months ago
- A Table format agnostic data sharing frameworkβ38Updated last year
- Apache Flink (Pyflink) and Related Projectsβ38Updated last month
- CLI tool to bulk migrate the tables from one catalog another without a data copyβ77Updated last month
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.β25Updated 4 months ago
- Delta Lake examplesβ224Updated 7 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkβ95Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Pythonβ44Updated 2 years ago
- β80Updated 3 weeks ago
- A Python Library to support running data quality rules while the spark job is runningβ‘β188Updated last week
- Spark style guideβ258Updated 7 months ago
- β265Updated 6 months ago
- trino monitoring with JMX metrics through Prometheus and Grafanaβ14Updated 9 months ago
- System Design, Solution Architecture, Data Systems Practiceβ46Updated 2 weeks ago
- A Python package to submit and manage Apache Spark applications on Kubernetes.β41Updated last month
- The official repository for the Rock the JVM Spark Optimization 2 courseβ39Updated last year
- A library that provides useful extensions to Apache Spark and PySpark.β223Updated last month
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)β56Updated last year
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trinoβ19Updated 2 years ago
- Flowchart for debugging Spark applicationsβ105Updated 7 months ago
- Multi-hop declarative data pipelinesβ115Updated this week
- Docker envinroment to stream data from Kafka to Iceberg tablesβ28Updated last year
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflowsβ43Updated 10 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflowβ215Updated this week