dttung2905 / flink-at-scale
๐ Tech blogs & talks by companies that run Apache Flink in production
โ151Updated last month
Related projects: โ
- โ23Updated 2 months ago
- Multi-hop declarative data pipelinesโ86Updated last month
- โ129Updated this week
- โ197Updated last month
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ111Updated last month
- โ248Updated last week
- The Internals of Delta Lakeโ180Updated last month
- โ37Updated last month
- Flowchart for debugging Spark applicationsโ100Updated this week
- โ232Updated this week
- โ77Updated last year
- Examples for using Apache Flinkยฎ with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.โ55Updated 11 months ago
- Open Control Plane for Tables in Data Lakehouseโ289Updated this week
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)โ179Updated this week
- Simple project to expose a catalog over REST using a Java catalog backendโ102Updated this week
- A Python Library to support running data quality rules while the spark job is runningโกโ161Updated last month
- A simple Spark-powered ETL framework that just works ๐บโ177Updated 9 months ago
- The Internals of Spark on Kubernetesโ71Updated 2 years ago
- Code snippets used in demos recorded for the blog.โ28Updated 5 months ago
- Apache Hive Metastore as a Standalone server in Dockerโ64Updated 3 weeks ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.โ141Updated 2 weeks ago
- A highly efficient daemon for streaming data from Kafka into Delta Lakeโ354Updated last week
- A library that provides useful extensions to Apache Spark and PySpark.โ193Updated this week
- Adapter for dbt that executes dbt pipelines on Apache Flinkโ80Updated 6 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0โ96Updated last year
- Spark style guideโ255Updated last year
- A list of all awesome open-source contributions for the Apache Kafka projectโ95Updated last year
- โ144Updated this week
- Apache Flink Stateful Functions Playgroundโ127Updated 11 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copyโ51Updated this week