polyzos / stream-processing-with-apache-flink
โ52Updated 7 months ago
Alternatives and similar repositories for stream-processing-with-apache-flink:
Users that are interested in stream-processing-with-apache-flink are comparing it to the libraries listed below
- Flowchart for debugging Spark applicationsโ105Updated 5 months ago
- ๐ Tech blogs & talks by companies that run Apache Flink in productionโ167Updated last month
- Code snippets used in demos recorded for the blog.โ30Updated last month
- Examples for using Apache Flinkยฎ with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.โ63Updated last year
- The Internals of Spark on Kubernetesโ70Updated 2 years ago
- Magic to help Spark pipelines upgradeโ34Updated 5 months ago
- The official repository for the Rock the JVM Spark Optimization 2 courseโ38Updated last year
- The official repository for the Rock the JVM Spark Optimization with Scala courseโ57Updated last year
- Sample processing code using Spark 2.1+ and Scalaโ51Updated 4 years ago
- Examples of Spark 3.0โ47Updated 4 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trinoโ19Updated 2 years ago
- The Internals of Delta Lakeโ183Updated 2 months ago
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.โ24Updated 3 months ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizerโ25Updated 2 months ago
- Presto Trino with Apache Hive Postgres metastoreโ40Updated 6 months ago
- Spark on Kubernetes using Helmโ34Updated 4 years ago
- The Internals of PySparkโ26Updated 2 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0โ97Updated 2 years ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureโ59Updated 2 months ago
- Yet Another (Spark) ETL Frameworkโ20Updated last year
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysisโ9Updated last year
- โ71Updated 2 months ago
- Scalable CDC Pattern Implemented using PySparkโ18Updated 5 years ago
- This repository contains recipes for Apache Pinot.โ30Updated 3 weeks ago
- A library that brings useful functions from various modern database management systems to Apache Sparkโ58Updated last year
- Data validation library for PySpark 3.0.0โ33Updated 2 years ago
- Lab for testing different Flink job latency optimization techniques covered in a Flink Forward 2021 talkโ27Updated 3 years ago
- A simple Spark-powered ETL framework that just works ๐บโ181Updated 3 weeks ago
- Apache Flink Guideโ56Updated 3 years ago
- Apache Spark 3 - Structured Streaming Course Materialโ45Updated 4 years ago