pmoskovi / flink-learning-resourcesLinks
A curated list of Apache Flink learning resources
β87Updated 7 months ago
Alternatives and similar repositories for flink-learning-resources
Users that are interested in flink-learning-resources are comparing it to the libraries listed below
Sorting:
- π Tech blogs & talks by companies that run Apache Flink in productionβ173Updated last month
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureβ92Updated 2 months ago
- A Python Library to support running data quality rules while the spark job is runningβ‘β189Updated 2 weeks ago
- Code snippets for Data Engineering Design Patterns bookβ148Updated 5 months ago
- β267Updated 10 months ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)β60Updated last year
- Delta Lake examplesβ226Updated 10 months ago
- β91Updated 7 months ago
- Weekly Data Engineering Newsletterβ96Updated last year
- Drop-in replacement for Apache Spark UIβ293Updated last week
- β59Updated last year
- A curated list of awesome blogs, videos, tools and resources about Data Contractsβ178Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflowβ70Updated 11 months ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testinβ¦β73Updated last year
- Apache Hive Metastore as a Standalone server in Dockerβ79Updated last year
- Simple stream processing pipelineβ103Updated last year
- Spark style guideβ262Updated 10 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lakeβ411Updated 3 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Pythonβ44Updated 2 years ago
- Code snippets used in demos recorded for the blog.β37Updated 2 weeks ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lakeβ279Updated this week
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.β26Updated 8 months ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.β164Updated 8 months ago
- Template for a data contract used in a data mesh.β474Updated last year
- Don't Panic. This guide will help you when it feels like the end of the world.β27Updated 2 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkβ95Updated last year
- Yet Another (Spark) ETL Frameworkβ21Updated last year
- Delta Lake Documentationβ49Updated last year
- Delta Lake helper methods in PySparkβ325Updated 11 months ago
- A Table format agnostic data sharing frameworkβ38Updated last year