dttung2905 / flink-at-scale
๐ Tech blogs & talks by companies that run Apache Flink in production
โ171Updated 3 months ago
Alternatives and similar repositories for flink-at-scale:
Users that are interested in flink-at-scale are comparing it to the libraries listed below
- โ53Updated 8 months ago
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.โ25Updated 4 months ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ121Updated last month
- The Internals of Delta Lakeโ183Updated 3 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureโ67Updated this week
- Flowchart for debugging Spark applicationsโ105Updated 7 months ago
- Code snippets used in demos recorded for the blog.โ35Updated last week
- Multi-hop declarative data pipelinesโ115Updated this week
- A curated list of Apache Flink learning resourcesโ64Updated 3 months ago
- โ265Updated 6 months ago
- Examples for using Apache Flinkยฎ with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.โ64Updated last year
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lakeโ251Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copyโ77Updated 3 weeks ago
- A simple Spark-powered ETL framework that just works ๐บโ181Updated last month
- Apache flinkโ67Updated 2 weeks ago
- Storage connector for Trinoโ110Updated this week
- โ80Updated last week
- Spark style guideโ258Updated 7 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lakeโ397Updated 2 weeks ago
- โ193Updated last week
- โ43Updated 3 years ago
- Apache Flink Guideโ57Updated 3 years ago
- A Python Library to support running data quality rules while the spark job is runningโกโ186Updated this week
- In-Memory Analytics for Kafka using DuckDBโ118Updated last week
- A library that provides useful extensions to Apache Spark and PySpark.โ223Updated last month
- Avro SerDe for Apache Spark structured APIs.โ234Updated 9 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkโ95Updated last year
- โ80Updated 3 months ago
- A repository containing materials for Stateful Functions workshopโ44Updated last year
- The Internals of Apache Kafkaโ54Updated last year