dttung2905 / flink-at-scaleLinks
๐ Tech blogs & talks by companies that run Apache Flink in production
โ172Updated 4 months ago
Alternatives and similar repositories for flink-at-scale
Users that are interested in flink-at-scale are comparing it to the libraries listed below
Sorting:
- โ57Updated 9 months ago
- Code snippets used in demos recorded for the blog.โ37Updated last month
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.โ25Updated 5 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureโ74Updated last month
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ125Updated last week
- Flowchart for debugging Spark applicationsโ105Updated 8 months ago
- Multi-hop declarative data pipelinesโ115Updated this week
- The Internals of Delta Lakeโ184Updated 4 months ago
- โ264Updated 7 months ago
- Examples for using Apache Flinkยฎ with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.โ64Updated last year
- A list of all awesome open-source contributions for the Apache Kafka projectโ103Updated last year
- A highly efficient daemon for streaming data from Kafka into Delta Lakeโ403Updated 3 weeks ago
- A Python Library to support running data quality rules while the spark job is runningโกโ188Updated this week
- A library that provides useful extensions to Apache Spark and PySpark.โ224Updated 2 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0โ98Updated 2 years ago
- Kafka Streams demo project containing Derivative Events, the Processor Api and Wall-clock examplesโ26Updated 4 years ago
- In-Memory Analytics for Kafka using DuckDBโ122Updated last week
- Apache Hive Metastore as a Standalone server in Dockerโ75Updated 9 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copyโ77Updated last month
- Apache Flink Training Excercisesโ124Updated last week
- โ84Updated 4 months ago
- โ80Updated last month
- Avro SerDe for Apache Spark structured APIs.โ235Updated 10 months ago
- Spark style guideโ259Updated 8 months ago
- The Internals of Spark on Kubernetesโ71Updated 3 years ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkโ95Updated last year
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lakeโ259Updated last week
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.โ161Updated 6 months ago
- A simple Spark-powered ETL framework that just works ๐บโ181Updated 3 weeks ago
- โ204Updated this week