dttung2905 / flink-at-scaleLinks
๐ Tech blogs & talks by companies that run Apache Flink in production
โ188Updated last month
Alternatives and similar repositories for flink-at-scale
Users that are interested in flink-at-scale are comparing it to the libraries listed below
Sorting:
- โ65Updated last year
- โ269Updated last year
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureโ140Updated last week
- A curated list of Apache Flink learning resourcesโ122Updated last year
- The Internals of Delta Lakeโ187Updated 2 months ago
- Drop-in replacement for Apache Spark UIโ397Updated this week
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.โ27Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ132Updated 3 weeks ago
- Flowchart for debugging Spark applicationsโ106Updated last year
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lakeโ296Updated this week
- Spark style guideโ272Updated last year
- Examples for using Apache Flinkยฎ with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.โ65Updated 2 years ago
- Multi-hop declarative data pipelinesโ124Updated last week
- The Internals of Spark SQLโ483Updated last week
- Code snippets used in demos recorded for the blog.โ37Updated 2 weeks ago
- A simple Spark-powered ETL framework that just works ๐บโ182Updated 4 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0โ103Updated 3 years ago
- โ81Updated 9 months ago
- A library that provides useful extensions to Apache Spark and PySpark.โ232Updated last week
- โ109Updated last year
- Avro SerDe for Apache Spark structured APIs.โ240Updated 7 months ago
- A simplified, lightweight ETL Framework based on Apache Sparkโ588Updated 2 years ago
- A highly efficient daemon for streaming data from Kafka into Delta Lakeโ427Updated 8 months ago
- โ201Updated this week
- โ241Updated last week
- Apache Hive Metastore as a Standalone server in Dockerโ79Updated last year
- a curated list of awesome lakehouse frameworks, applications, etcโ40Updated 2 months ago
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!โ235Updated last year
- The Internals of Apache Kafkaโ57Updated 2 years ago
- A list of all awesome open-source contributions for the Apache Kafka projectโ108Updated 2 years ago