jaehyeon-kim / flink-demos
Apache Flink (Pyflink) and Related Projects
ā34Updated 9 months ago
Alternatives and similar repositories for flink-demos:
Users that are interested in flink-demos are comparing it to the libraries listed below
- ā54Updated last year
- ā51Updated 7 months ago
- š Tech blogs & talks by companies that run Apache Flink in productionā166Updated last month
- Sample code to collect Apache Iceberg metrics for table monitoringā24Updated 6 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0ā97Updated 2 years ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize aā¦ā22Updated 10 months ago
- Spark on Kubernetes using Helmā34Updated 4 years ago
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake workā48Updated 2 years ago
- Presto Trino with Apache Hive Postgres metastoreā40Updated 6 months ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formatsā29Updated last year
- The Internals of Spark on Kubernetesā70Updated 2 years ago
- ā25Updated 11 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Pythonā42Updated 2 years ago
- Adapter for dbt that executes dbt pipelines on Apache Flinkā91Updated 11 months ago
- Collection of code examples for Amazon Managed Service for Apache Flinkā48Updated 3 weeks ago
- ā71Updated last month
- Build Data Lake using Open Source toolsā91Updated 4 months ago
- Spark ETL example processing New York taxi rides public dataset on EKSā44Updated 2 years ago
- Code snippets for Data Engineering Design Patterns bookā74Updated last month
- A Table format agnostic data sharing frameworkā38Updated last year
- Yet Another (Spark) ETL Frameworkā20Updated last year
- ā19Updated last month
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL databaseā72Updated 3 years ago
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,ā¦ā28Updated last month
- ā49Updated last week
- Spark history server Helm Chartā20Updated 11 months ago
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational eā¦ā103Updated 3 months ago
- dbt-starrocks contains all of the code enabling dbt to work with StarRocksā25Updated this week
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.ā13Updated last year
- Docker envinroment to stream data from Kafka to Iceberg tablesā25Updated last year