aws-samples/aws-glue-streaming-ingestion-from-kafka-to-apache-iceberg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-samples/aws-glue-streaming-ingestion-from-kafka-to-apache-iceberg)

aws-samples / aws-glue-streaming-ingestion-from-kafka-to-apache-iceberg

This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (MSK) and MSK Serverless into Apache Iceberg table in S3 with AWS Glue Streaming.

☆16

Alternatives and similar repositories for aws-glue-streaming-ingestion-from-kafka-to-apache-iceberg

Users that are interested in aws-glue-streaming-ingestion-from-kafka-to-apache-iceberg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / aws-glue-streaming-etl-with-apache-iceberg
View on GitHub
Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3
☆27Sep 10, 2024Updated last year
aws-samples / aws-account-migration-example
View on GitHub
☆17Jan 11, 2024Updated 2 years ago
aws-samples / emr-trino-autoscale
View on GitHub
☆23Feb 14, 2025Updated last year
aws-samples / retail-large-data-ml-e2e
View on GitHub
小売業で予測ベースの発注を実現するためのサンプルソリューション
☆17Apr 10, 2025Updated last year
aws-samples / amazon-redshift-streaming-workshop
View on GitHub
This repository provides the resources required for the Amazon Redshift Streaming workshop
☆13Apr 13, 2026Updated 3 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
aws-samples / bluegreen-to-amazon-ecs-using-aws-cdk-aws-codedeploy
View on GitHub
This is a sample app created to illustrate the best practices outlined in the following blog post
☆15Oct 6, 2023Updated 2 years ago
kaloureyes3 / v4-clients
View on GitHub
☆10Apr 5, 2024Updated 2 years ago
aws-samples / aws-genai-audio-text-chat-moderation
View on GitHub
☆11May 7, 2024Updated 2 years ago
aws-samples / aws-lakeformation-datasharing-workflow
View on GitHub
☆15Feb 12, 2026Updated 5 months ago
farhansadeed / Python-COVID-19-Trade-Impact-Data-Analysis
View on GitHub
This repository contains an analysis of the effects of COVID-19 on trade trends up to December 2021. The dataset used provides daily trad…
☆16Aug 16, 2023Updated 2 years ago
aws-samples / transactional-datalake-using-apache-iceberg-on-aws-glue
View on GitHub
Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS
☆36Updated this week
karlospn / building-qa-app-with-aws-bedrock-kendra-s3-and-streamlit
View on GitHub
Building a Q&A app (powered by a LLM model) using AWS Bedrock, AWS Kendra, AWS S3 and Streamlit in just a couple of hours
☆17Dec 7, 2023Updated 2 years ago
danilop / lambda-rust-and-cdk
View on GitHub
☆19Dec 23, 2022Updated 3 years ago
aws-samples / generative-ai-on-aws-architecture-patterns
View on GitHub
☆21Nov 11, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
dwijendra626 / DataAnalysis_and_Visualization_Projects
View on GitHub
This repository contains End-to-end Data Analytics Projects
☆17Jun 6, 2023Updated 3 years ago
ritik8801 / Data-Analysis-of-Bicycle-Manufacturing-Company-Using-Python-SQL-and-Power-BI
View on GitHub
Data Analysis of Bicycle Manufacturing Company Using Python, SQL and Power BI
☆14Apr 14, 2023Updated 3 years ago
sahilbhange / Facebook-Data-Extraction
View on GitHub
#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Grap…
☆13Jun 27, 2018Updated 8 years ago
karlospn / building-qa-app-with-openai-pinecone-and-streamlit
View on GitHub
Building a GPT-4 Q&A app using Azure OpenAI, Pinecone and Streamlit in just a couple of hours
☆23Jul 6, 2023Updated 3 years ago
oreillymedia / Data_Analytics_with_Hadoop
View on GitHub
☆15Feb 20, 2018Updated 8 years ago
aws-samples / amazon-managed-service-for-apache-flink-examples
View on GitHub
Collection of code examples for Amazon Managed Service for Apache Flink
☆90Jun 16, 2026Updated last month
aws-samples / emr-spark-benchmark
View on GitHub
☆26Apr 26, 2026Updated 3 months ago
KhushiBhadange / Exploratory-Data-Analysis-Mental-Health-Problem
View on GitHub
In this repository, explore insightful solutions through exploratory data analysis focusing on mental health problems. Gain valuable insi…
☆26Dec 18, 2023Updated 2 years ago
devopsbox-io / aws-ecr-cleaner
View on GitHub
Removes unused images from AWS ECR
☆19Apr 7, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
aws-samples / amazon-bedrock-ai-karaoke
View on GitHub
Amazon Bedrock AI Karaoke is an interactive demonstration of Amazon Bedrock. Users complete the prompt with the microphone and choose the…
☆19Jan 29, 2025Updated last year
mrpowers-io / levi
View on GitHub
Delta Lake helper methods. No Spark dependency.
☆22Jan 19, 2026Updated 6 months ago
awslabs / aws-glue-streaming-libs
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
PacktPublishing / Fundamentals-of-Apache-Flink
View on GitHub
Fundamentals of Apache Flink [video], published by Packt
☆12Jan 30, 2023Updated 3 years ago
dbt-labs / dbtcloud-terraforming
View on GitHub
CLI tool to help importing existing dbt Cloud config to Terraform
☆31Jul 20, 2026Updated last week
in28minutes / hello-world-rest-api-aws-ecs-codepipeline
View on GitHub
☆14Jan 9, 2020Updated 6 years ago
aws-samples / spark-on-aws-lambda
View on GitHub
Spark runtime on AWS Lambda
☆113Aug 28, 2025Updated 11 months ago
40net-cloud / fortinet-aws-solutions
View on GitHub
☆20Dec 22, 2025Updated 7 months ago
aws-samples / monitoring-apache-iceberg-table-metadata-layer
View on GitHub
Sample code to collect Apache Iceberg metrics for table monitoring
☆29Aug 18, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
eon01 / LLMPromptEngineeringForDevelopersFiles
View on GitHub
This repository contains the code snippets used in "LLM Prompt Engineering For Developers"
☆14Apr 22, 2024Updated 2 years ago
aws-samples / amazon-rekognition-large-scale-processing
View on GitHub
☆12Sep 24, 2020Updated 5 years ago
aws-samples / aws-jp-iot-samples
View on GitHub
AWS IoT Services sample codes. these will be used in AWS IoT hands-on/workshops in Japan.
☆11Jan 14, 2021Updated 5 years ago
FareedKhan-dev / AI-outlier-detection
View on GitHub
Outlier Detection with AI + ML
☆15Sep 12, 2025Updated 10 months ago
awslabs / app-server-migration
View on GitHub
app-server-migration helps in discovering the changes required to migrate the code from source server to target server and provides effor…
☆16May 5, 2026Updated 2 months ago
ismaildawoodjee / aws-data-pipeline
View on GitHub
A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…
☆24May 14, 2022Updated 4 years ago
cjtrowbridge / YouTube-Backup
View on GitHub
A simple script for backing up your favorite YouTube channels.
☆12Jan 27, 2024Updated 2 years ago