aws-samples/iceberg-streaming-examples

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-samples/iceberg-streaming-examples)

aws-samples / iceberg-streaming-examples

This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenarios using best practices. The code can be deployed into any Spark compatible engine like Amazon EMR Serverless or AWS Glue. A fully local developer environment is also provided.

☆28

Alternatives and similar repositories for iceberg-streaming-examples

Users that are interested in iceberg-streaming-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / apache-xtable-on-aws-samples
View on GitHub
☆11Jun 8, 2026Updated last month
aws-samples / monitoring-apache-iceberg-table-metadata-layer
View on GitHub
Sample code to collect Apache Iceberg metrics for table monitoring
☆29Aug 18, 2024Updated last year
aws-samples / aws-saas-genai-rag-workshop
View on GitHub
☆19Nov 18, 2025Updated 8 months ago
aws-samples / access-aws-services-programmatically-using-tip
View on GitHub
A Python CLI application that demonstrates how you can access AWS services, such as Amazon S3 and Amazon Athena, using trusted identity p…
☆13Mar 11, 2025Updated last year
aws-samples / aws-emr-serverless-using-terraform
View on GitHub
☆15Dec 19, 2025Updated 7 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
IvanWoo / trino-on-kubernetes
View on GitHub
☆10May 5, 2022Updated 4 years ago
aws-samples / transactional-datalake-using-amazon-datafirehose-iceberg
View on GitHub
Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with Amazon Data Firehose and DMS
☆18Feb 15, 2025Updated last year
aws-samples / sample-amazon-ecs-blue-green-deployment-patterns
View on GitHub
Sample Patterns for Amazon ECS blue/green deployments
☆15Jul 8, 2026Updated 2 weeks ago
aws-samples / rgcn-fraud-detector
View on GitHub
RGCN model for real-time fraud detection
☆10Jan 27, 2023Updated 3 years ago
soumilshah1995 / duckdb-etl-framework
View on GitHub
duckdb-etl-framework
☆14Dec 20, 2024Updated last year
bartosz25 / data-ai-summit-2024
View on GitHub
Visits sessionization pipeline used for the talk
☆13May 28, 2024Updated 2 years ago
reisdebora / awesome-databricks
View on GitHub
A curated list of awesome Databricks resources, including Spark
☆22Jun 28, 2024Updated 2 years ago
aws-samples / emr-on-eks-benchmark
View on GitHub
☆32Jul 2, 2026Updated 3 weeks ago
scravy / pysparkextra
View on GitHub
☆10Jun 29, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
T3los / mRemoteNGOpenVPN
View on GitHub
☆16Aug 12, 2025Updated 11 months ago
himewel / covid19retail
View on GitHub
Lambda data architecture reference project — dbt medallion models, Airflow DAGs, Superset dashboards, and Marquez lineage for Iowa retail…
☆14Jul 12, 2026Updated last week
dacort / ci-cd-serverless-spark
View on GitHub
Demo for GitHub Universe 2022
☆13Jan 31, 2023Updated 3 years ago
marcincuber / kubernetes-fluxv2
View on GitHub
Kubernetes- helm deployments using Flux V2
☆12Apr 29, 2024Updated 2 years ago
dockersamples / go-prometheus-monitoring
View on GitHub
A Golang application that demonstrates how to monitor a Golang service using Prometheus and Grafana. This is for Docker's official Deno L…
☆15Mar 22, 2025Updated last year
robertgv / aws-community-builders-dashboard
View on GitHub
Welcome to the AWS Community Builders Dashboard repository!
☆17Aug 29, 2025Updated 10 months ago
tiagotxm / yt-spark-no-kubernetes
View on GitHub
☆13Feb 19, 2025Updated last year
aws-samples / amazon-sagemaker-mlops-template-with-aws-lambda-deployment
View on GitHub
☆10Nov 2, 2023Updated 2 years ago
GeorgeHahn / readwise-epub
View on GitHub
Create EPUBs from your Readwise Reader inbox
☆16Jul 8, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
aws-samples / amazon-appflow-custom-jdbc-connector
View on GitHub
☆21Dec 16, 2024Updated last year
cnoe-io / reference-implementation-aws
View on GitHub
This is the reference implementation of CNOE and its toolings on AWS
☆126Jul 29, 2025Updated 11 months ago
rwxd / pyreadwise
View on GitHub
Python Module to use the Readwise API
☆21Jan 24, 2026Updated 5 months ago
aws-samples / aws-building-data-lake-reinvent-session-stg206
View on GitHub
Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…
☆26Apr 9, 2019Updated 7 years ago
aws-solutions / verifiable-controls-evidence-store
View on GitHub
This repository contains the source code of the Verifiable Controls Evidence Store solution
☆19Feb 19, 2025Updated last year
dipankarmazumdar / awesome-lakehouse-guide
View on GitHub
Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture
☆177May 22, 2026Updated 2 months ago
apalrd / riscv-morse
View on GitHub
Sample of a morse-code 'blinky' in different architectures
☆18Oct 9, 2023Updated 2 years ago
aws-samples / patient-matching-of-clinical-trials-using-generative-ai
View on GitHub
☆21May 29, 2025Updated last year
xdanny / pyspark_types
View on GitHub
Map your python dataclasses to pyspark types
☆10Feb 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
repomirrorhq / open-dedalus
View on GitHub
☆16Aug 23, 2025Updated 10 months ago
cloud-copilot / iam-shrink
View on GitHub
Make AWS IAM policies smaller by adding wildcards to actions.
☆19Updated this week
scrapedia / scrapy-pipelines
View on GitHub
A collection of pipelines for Scrapy
☆16Apr 27, 2026Updated 2 months ago
aws-samples / aws-emr-advisor
View on GitHub
EMR Advisor uses Spark Event Logs to generate insights and costs/runtime recommendations using different deployment options for Amazon EM…
☆16Jun 5, 2025Updated last year
DragonPomelo / trino-opa-example
View on GitHub
☆25Feb 7, 2024Updated 2 years ago
aws-samples / aws-streaming-generative-ai-application
View on GitHub
Streaming Generative AI Application on AWS
☆14Jun 24, 2024Updated 2 years ago
klescosia / aws-glue-delta-lake
View on GitHub
This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon …
☆18Aug 25, 2021Updated 4 years ago