This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenarios using best practices. The code can be deployed into any Spark compatible engine like Amazon EMR Serverless or AWS Glue. A fully local developer environment is also provided.
☆27Mar 17, 2026Updated 2 months ago
Alternatives and similar repositories for iceberg-streaming-examples
Users that are interested in iceberg-streaming-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year
- ☆19Nov 18, 2025Updated 6 months ago
- A Python CLI application that demonstrates how you can access AWS services, such as Amazon S3 and Amazon Athena, using trusted identity p…☆13Mar 11, 2025Updated last year
- ☆15Dec 19, 2025Updated 5 months ago
- ☆10May 5, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- RGCN model for real-time fraud detection☆11Jan 27, 2023Updated 3 years ago
- duckdb-etl-framework☆15Dec 20, 2024Updated last year
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- Automate rule management for AWS Network Firewall☆17May 11, 2026Updated last week
- ☆10Jun 29, 2021Updated 4 years ago
- ☆11Oct 19, 2023Updated 2 years ago
- ☆32Jan 30, 2026Updated 3 months ago
- Demo for GitHub Universe 2022☆13Jan 31, 2023Updated 3 years ago
- ☆10Nov 2, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NIFTY50 Data Analysis from scratch (Data Extraction & Visualization to Investment Insights)☆16May 20, 2023Updated 3 years ago
- CCCS security control profiles expressed using OSCAL☆21Oct 6, 2025Updated 7 months ago
- Change Data Capture (CDC) from PostgreSQL to ClickHouse☆16Jul 15, 2024Updated last year
- Local AWS - a lightweight AWS service emulator☆44Updated this week
- Map your python dataclasses to pyspark types☆10Feb 11, 2024Updated 2 years ago
- ☆10Apr 2, 2024Updated 2 years ago
- Streaming Generative AI Application on AWS☆14Jun 24, 2024Updated last year
- ☆13Feb 19, 2025Updated last year
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon …☆18Aug 25, 2021Updated 4 years ago
- Docker containers for building and testing Vertica extensions☆17Jan 7, 2026Updated 4 months ago
- ☆16Oct 18, 2023Updated 2 years ago
- ☆20Mar 13, 2025Updated last year
- Serverless Multi-Tenant Application on AWS Amplify☆17Jan 11, 2024Updated 2 years ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Nov 16, 2022Updated 3 years ago
- OpenID Shared Signals and Events (SSE) / Continuous Access Evaluation Protocol (CAEP) / Risk Incident Sharing and Coordination (RISC) JSO…☆15Jun 7, 2024Updated last year
- Sample application showcasing the use of Dapr to build microservices based apps☆15Feb 4, 2026Updated 3 months ago
- Solution to specify elastic and dynamic cloud resources as objects that can be easily referenced within AWS Network Firewall rules☆18Feb 19, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Terragrunt friendly module to create AWS API Gateway (V1) w\Optional WAF, many stages/api keys/usage plans using the OpenAPI 3.x spec. 🇺…☆11Feb 13, 2026Updated 3 months ago
- ☆23Feb 7, 2024Updated 2 years ago
- An example Terraform repo that utilizes the upstream EKS blueprints project from AWS Integration and Automation.☆14May 11, 2022Updated 4 years ago
- This is a collecton of CDK projects to show how to load data from streaming services into Amazon Redshift.☆13Sep 10, 2024Updated last year
- ☆23Nov 17, 2019Updated 6 years ago
- Custom kube-scheduler for binpacking targeting Spark on EKS and other jobs workloads☆29Feb 24, 2026Updated 2 months ago
- Local AWS EMR - A local service that imitates AWS EMR☆27Jul 5, 2023Updated 2 years ago