pmoskovi / flink-learning-resources
A curated list of Apache Flink learning resources
β56Updated 3 months ago
Alternatives and similar repositories for flink-learning-resources:
Users that are interested in flink-learning-resources are comparing it to the libraries listed below
- π Tech blogs & talks by companies that run Apache Flink in productionβ169Updated 3 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkβ94Updated last year
- β75Updated 3 months ago
- Sample code to collect Apache Iceberg metrics for table monitoringβ26Updated 8 months ago
- β53Updated 8 months ago
- Code snippets for Data Engineering Design Patterns bookβ80Updated last month
- Apache Hive Metastore as a Standalone server in Dockerβ73Updated 8 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architectureβ60Updated 3 months ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)β56Updated last year
- A repository of sample code to accompany our blog post on Airflow and dbt.β171Updated last year
- Library to convert DBT manifest metadata to Airflow tasksβ48Updated last year
- A Python Library to support running data quality rules while the spark job is runningβ‘β183Updated last week
- dbt + Trino demo project, using TPC-H sample dataβ19Updated last year
- Yet Another (Spark) ETL Frameworkβ20Updated last year
- β79Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Pythonβ43Updated 2 years ago
- Code snippets used in demos recorded for the blog.β33Updated this week
- β263Updated 6 months ago
- Multi-hop declarative data pipelinesβ112Updated last week
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL databaseβ74Updated 3 years ago
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.β24Updated 4 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0β98Updated 2 years ago
- Utility functions for dbt projects running on Trinoβ21Updated last year
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)β232Updated 3 weeks ago
- Replicates any database (CDC events) to Bigquery in real timeβ21Updated this week
- Presto Trino with Apache Hive Postgres metastoreβ41Updated 7 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbtβ137Updated 9 months ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Sparkβ¦β111Updated 2 months ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testinβ¦β67Updated last year
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lakeβ246Updated this week