jaehyeon-kim / flink-demos
Apache Flink (Pyflink) and Related Projects
☆30Updated 8 months ago
Alternatives and similar repositories for flink-demos:
Users that are interested in flink-demos are comparing it to the libraries listed below
- ☆47Updated 6 months ago
- ☆53Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆90Updated 11 months ago
- ☆24Updated 6 months ago
- ☆18Updated last week
- Sample code to collect Apache Iceberg metrics for table monitoring☆24Updated 6 months ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆19Updated 2 years ago
- ☆40Updated last year
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆25Updated 11 months ago
- ☆25Updated 11 months ago
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆36Updated 2 weeks ago
- Presto Trino with Apache Hive Postgres metastore☆39Updated 5 months ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated last year
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆65Updated 3 years ago
- ☆12Updated 2 months ago
- ☆53Updated this week
- ☆258Updated 3 months ago
- ☆15Updated last year
- A Python package to submit and manage Apache Spark applications on Kubernetes.☆41Updated this week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆97Updated last week
- Code snippets for Data Engineering Design Patterns book☆69Updated 2 weeks ago
- Spark runtime on AWS Lambda☆105Updated 5 months ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆72Updated 3 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated 2 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆62Updated last year
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆28Updated last month
- A Table format agnostic data sharing framework☆38Updated last year
- Utility functions for dbt projects running on Trino☆21Updated last year
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆163Updated 3 weeks ago