pmoskovi / flink-learning-resourcesLinks
A curated list of Apache Flink learning resources
β106Updated 10 months ago
Alternatives and similar repositories for flink-learning-resources
Users that are interested in flink-learning-resources are comparing it to the libraries listed below
Sorting:
- π Tech blogs & talks by companies that run Apache Flink in productionβ183Updated last week
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureβ123Updated last week
- β269Updated last year
- β104Updated 10 months ago
- A Python Library to support running data quality rules while the spark job is runningβ‘β191Updated last week
- Adapter for dbt that executes dbt pipelines on Apache Flinkβ96Updated last year
- Delta Lake examplesβ231Updated last year
- Drop-in replacement for Apache Spark UIβ347Updated 3 weeks ago
- Code snippets for Data Engineering Design Patterns bookβ271Updated 8 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copyβ83Updated 7 months ago
- β62Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Pythonβ44Updated 2 years ago
- Simple stream processing pipelineβ110Updated last year
- A Table format agnostic data sharing frameworkβ42Updated last year
- Code snippets used in demos recorded for the blog.β37Updated 2 weeks ago
- Spark style guideβ264Updated last year
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lakeβ292Updated this week
- Apache Hive Metastore as a Standalone server in Dockerβ80Updated last year
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)β63Updated 2 years ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)β253Updated 2 months ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.β167Updated 2 months ago
- A curated list of awesome blogs, videos, tools and resources about Data Contractsβ180Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflowβ74Updated last week
- Open source stack lakehouseβ25Updated last year
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.β26Updated 10 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lakeβ417Updated 6 months ago
- Flowchart for debugging Spark applicationsβ107Updated last year
- Weekly Data Engineering Newsletterβ96Updated last year
- The Internals of Delta Lakeβ186Updated 10 months ago
- Don't Panic. This guide will help you when it feels like the end of the world.β29Updated 2 months ago