jaehyeon-kim / kafka-pocs
Apache Kafka and Related Projects
☆28Updated 10 months ago
Alternatives and similar repositories for kafka-pocs:
Users that are interested in kafka-pocs are comparing it to the libraries listed below
- Sample code to collect Apache Iceberg metrics for table monitoring☆23Updated 5 months ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆65Updated 3 years ago
- Terraform modules for provisioning and managing AWS Glue resources☆28Updated this week
- Terraform module to support Sagemaker for AWS provider☆31Updated last year
- AWS Quick Start Team☆18Updated 3 months ago
- Repo that will help you explore how to build a hybrid workflow using Apache Airflow and Amazon ECS Anywhere☆10Updated 2 years ago
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆110Updated 2 months ago
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆62Updated 3 weeks ago
- Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelines☆40Updated 2 years ago
- Terraform module to create AWS EMR resources 🇺🇦☆24Updated 2 weeks ago
- Demo for GitHub Universe 2022☆12Updated 2 years ago
- Apache Flink (Pyflink) and Related Projects☆29Updated 7 months ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects☆42Updated last month
- dbt / Amazon Redshift Demonstration Project☆33Updated 2 years ago
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆103Updated last month
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated last year
- Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS☆73Updated last month
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆49Updated last year
- ☆41Updated this week
- ☆21Updated 4 months ago
- This repository provides the resources required for the Amazon Redshift Streaming workshop☆12Updated last year
- AWS serverless etl and streaming demo☆18Updated 3 years ago
- Data Pipeline for CDC data from MySQL DB to Amazon OpenSearch Service through Amazon Kinesis using Amazon Data Migration Service(DMS).☆29Updated last week
- ☆34Updated 2 years ago
- Build DataOps platform with Apache Airflow and dbt on AWS☆53Updated 3 years ago
- This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (M…☆11Updated 4 months ago
- This is a collecton of CDK projects to show how to load data from streaming services into Amazon Redshift.☆13Updated 4 months ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆20Updated 2 months ago