dttung2905 / flink-at-scale
๐ Tech blogs & talks by companies that run Apache Flink in production
โ162Updated this week
Alternatives and similar repositories for flink-at-scale:
Users that are interested in flink-at-scale are comparing it to the libraries listed below
- โ47Updated 5 months ago
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.โ19Updated last month
- Multi-hop declarative data pipelinesโ107Updated this week
- Flowchart for debugging Spark applicationsโ104Updated 4 months ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ118Updated last week
- Examples for using Apache Flinkยฎ with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.โ61Updated last year
- The Internals of Delta Lakeโ183Updated 2 weeks ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureโ50Updated 2 weeks ago
- A library that provides useful extensions to Apache Spark and PySpark.โ207Updated last month
- CLI tool to bulk migrate the tables from one catalog another without a data copyโ73Updated this week
- Code snippets used in demos recorded for the blog.โ29Updated 2 weeks ago
- โ257Updated 3 months ago
- Spark style guideโ257Updated 3 months ago
- A simple Spark-powered ETL framework that just works ๐บโ178Updated last year
- Avro SerDe for Apache Spark structured APIs.โ231Updated 6 months ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lakeโ219Updated this week
- The Internals of Spark on Kubernetesโ70Updated 2 years ago
- Spark on Kubernetes using Helmโ34Updated 4 years ago
- A Python Library to support running data quality rules while the spark job is runningโกโ168Updated this week
- โ63Updated 2 weeks ago
- Examples of Spark 3.0โ47Updated 4 years ago
- โ41Updated 2 years ago
- โ63Updated 5 years ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkโ90Updated 10 months ago
- โ79Updated last year
- Apache Flink Training Excercisesโ121Updated 3 months ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizerโ25Updated 3 weeks ago
- A list of all awesome open-source contributions for the Apache Kafka projectโ98Updated last year
- The Internals of Apache Kafkaโ51Updated last year
- Lab for testing different Flink job latency optimization techniques covered in a Flink Forward 2021 talkโ27Updated 3 years ago