dttung2905 / flink-at-scaleLinks
๐ Tech blogs & talks by companies that run Apache Flink in production
โ172Updated 2 weeks ago
Alternatives and similar repositories for flink-at-scale
Users that are interested in flink-at-scale are comparing it to the libraries listed below
Sorting:
- โ58Updated 11 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureโ89Updated 3 weeks ago
- A curated list of Apache Flink learning resourcesโ78Updated 6 months ago
- โ266Updated 8 months ago
- Code snippets used in demos recorded for the blog.โ37Updated last month
- Flowchart for debugging Spark applicationsโ105Updated 9 months ago
- The Internals of Delta Lakeโ184Updated 6 months ago
- Examples for using Apache Flinkยฎ with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.โ64Updated last year
- Multi-hop declarative data pipelinesโ117Updated last month
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.โ26Updated 6 months ago
- โ90Updated 5 months ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ125Updated last month
- Drop-in replacement for Apache Spark UIโ273Updated last week
- A simple Spark-powered ETL framework that just works ๐บโ181Updated 2 weeks ago
- A Python Library to support running data quality rules while the spark job is runningโกโ188Updated this week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lakeโ271Updated this week
- โ80Updated 2 months ago
- Spark style guideโ258Updated 9 months ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.โ163Updated 7 months ago
- A library that provides useful extensions to Apache Spark and PySpark.โ227Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copyโ79Updated 3 months ago
- A list of all awesome open-source contributions for the Apache Kafka projectโ104Updated 2 years ago
- Delta Lake examplesโ226Updated 9 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lakeโ408Updated 2 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkโ95Updated last year
- The Internals of Spark on Kubernetesโ71Updated 3 years ago
- Apache Flink Stateful Functions Playgroundโ130Updated last year
- A tool to validate data, built around Apache Spark.โ101Updated last week
- The official repository for the Rock the JVM Spark Optimization 2 courseโ40Updated last year
- A simplified, lightweight ETL Framework based on Apache Sparkโ587Updated last year