pmoskovi / flink-learning-resourcesLinks
A curated list of Apache Flink learning resources
β70Updated 4 months ago
Alternatives and similar repositories for flink-learning-resources
Users that are interested in flink-learning-resources are comparing it to the libraries listed below
Sorting:
- π Tech blogs & talks by companies that run Apache Flink in productionβ173Updated 4 months ago
- β57Updated 9 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Pythonβ44Updated 2 years ago
- β85Updated 4 months ago
- Code snippets used in demos recorded for the blog.β37Updated last month
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureβ74Updated last month
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testinβ¦β71Updated last year
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.β26Updated 5 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkβ95Updated last year
- Apache Flink (Pyflink) and Related Projectsβ39Updated last month
- Simple repo to demonstrate how to submit a spark job to EMR from Airflowβ33Updated 4 years ago
- Delta Lake examplesβ225Updated 7 months ago
- Simple stream processing pipelineβ103Updated 11 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbtβ140Updated 10 months ago
- β264Updated 7 months ago
- β12Updated last year
- Code snippets for Data Engineering Design Patterns bookβ116Updated 2 months ago
- A repository of sample code to accompany our blog post on Airflow and dbt.β173Updated last year
- A Python Library to support running data quality rules while the spark job is runningβ‘β188Updated this week
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)β58Updated last year
- Weekly Data Engineering Newsletterβ95Updated 10 months ago
- Presto Trino with Apache Hive Postgres metastoreβ41Updated 8 months ago
- Spark data pipeline that processes movie ratings data.β28Updated this week
- Collection of code examples for Amazon Managed Service for Apache Flinkβ58Updated this week
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.β161Updated 6 months ago
- Docker with Airflow and Spark standalone clusterβ256Updated last year
- Spark style guideβ259Updated 8 months ago
- β44Updated 3 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflowβ70Updated 8 months ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMRβ66Updated 3 years ago