dttung2905 / flink-at-scaleLinks
๐ Tech blogs & talks by companies that run Apache Flink in production
โ186Updated 2 weeks ago
Alternatives and similar repositories for flink-at-scale
Users that are interested in flink-at-scale are comparing it to the libraries listed below
Sorting:
- โ63Updated last year
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureโ125Updated last month
- โ269Updated last year
- A curated list of Apache Flink learning resourcesโ115Updated 11 months ago
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.โ27Updated last year
- Flowchart for debugging Spark applicationsโ107Updated last year
- The Internals of Delta Lakeโ187Updated 3 weeks ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lakeโ291Updated 2 weeks ago
- Drop-in replacement for Apache Spark UIโ370Updated 3 weeks ago
- Examples for using Apache Flinkยฎ with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.โ65Updated 2 years ago
- Multi-hop declarative data pipelinesโ122Updated last week
- Code snippets used in demos recorded for the blog.โ37Updated last week
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.โ168Updated 3 months ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ130Updated last week
- โ107Updated 11 months ago
- A simple Spark-powered ETL framework that just works ๐บโ181Updated 2 months ago
- A library that provides useful extensions to Apache Spark and PySpark.โ231Updated last week
- Don't Panic. This guide will help you when it feels like the end of the world.โ30Updated 3 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lakeโ424Updated 7 months ago
- A Python Library to support running data quality rules while the spark job is runningโกโ193Updated this week
- Spark style guideโ266Updated last year
- โ81Updated 8 months ago
- Apache Flink Guideโ59Updated 4 years ago
- In-Memory Analytics for Kafka using DuckDBโ146Updated last month
- Apache Flink Stateful Functions Playgroundโ134Updated 2 years ago
- Open Control Plane for Tables in Data Lakehouseโ375Updated this week
- The Internals of Apache Kafkaโ57Updated 2 years ago
- a curated list of awesome lakehouse frameworks, applications, etcโ37Updated 3 weeks ago
- Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)โ230Updated 3 years ago
- Avro SerDe for Apache Spark structured APIs.โ238Updated 6 months ago