This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenarios using best practices. The code can be deployed into any Spark compatible engine like Amazon EMR Serverless or AWS Glue. A fully local developer environment is also provided.
☆27Mar 17, 2026Updated last month
Alternatives and similar repositories for iceberg-streaming-examples
Users that are interested in iceberg-streaming-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Nov 26, 2024Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year
- ☆19Nov 18, 2025Updated 5 months ago
- A Python CLI application that demonstrates how you can access AWS services, such as Amazon S3 and Amazon Athena, using trusted identity p…☆13Mar 11, 2025Updated last year
- RGCN model for real-time fraud detection☆11Jan 27, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- DS4Windows is an open-source gamepad input mapper and virtual emulator designed to use and connect your PlayStation controller (DualShock…☆43Mar 1, 2026Updated 2 months ago
- Visits sessionization pipeline used for the talk☆13May 28, 2024Updated last year
- Automate rule management for AWS Network Firewall☆17Apr 24, 2026Updated last week
- ☆10Jun 29, 2021Updated 4 years ago
- ☆11Oct 19, 2023Updated 2 years ago
- ☆32Jan 30, 2026Updated 3 months ago
- ☆17Nov 26, 2024Updated last year
- Python Module to use the Readwise API☆20Jan 24, 2026Updated 3 months ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆48Mar 23, 2026Updated last month
- Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…☆26Apr 9, 2019Updated 7 years ago
- CCCS security control profiles expressed using OSCAL☆21Oct 6, 2025Updated 6 months ago
- Sample of a morse-code 'blinky' in different architectures☆18Oct 9, 2023Updated 2 years ago
- Local AWS - a lightweight AWS service emulator☆43Apr 26, 2026Updated last week
- ☆10Apr 2, 2024Updated 2 years ago
- A collection of pipelines for Scrapy☆16Mar 30, 2026Updated last month
- Streaming Generative AI Application on AWS☆14Jun 24, 2024Updated last year
- ☆13Feb 19, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PowerShell function to indicate progress, using cli-spinner icons, during longer running tasks.☆13Mar 31, 2021Updated 5 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- End to end data pipeline☆22Apr 13, 2025Updated last year
- This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon …☆18Aug 25, 2021Updated 4 years ago
- ☆16Oct 18, 2023Updated 2 years ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆29Apr 4, 2026Updated 3 weeks ago
- ☆20Mar 13, 2025Updated last year
- Serverless Multi-Tenant Application on AWS Amplify☆17Jan 11, 2024Updated 2 years ago
- Real-time OLTP system for credit card fraud detection using AWS API Gateway, Kinesis, and RDS PostgreSQL. Features a scalable, serverless…☆24Dec 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OpenID Shared Signals and Events (SSE) / Continuous Access Evaluation Protocol (CAEP) / Risk Incident Sharing and Coordination (RISC) JSO…☆15Jun 7, 2024Updated last year
- Sample application showcasing the use of Dapr to build microservices based apps☆15Feb 4, 2026Updated 2 months ago
- Terraform project to deploy Jenkins on ECS Fargate with Jenkins configuration stored in EFS and agents on ECS fargate☆17Feb 18, 2024Updated 2 years ago
- Solution to specify elastic and dynamic cloud resources as objects that can be easily referenced within AWS Network Firewall rules☆18Feb 19, 2025Updated last year
- Terragrunt friendly module to create AWS API Gateway (V1) w\Optional WAF, many stages/api keys/usage plans using the OpenAPI 3.x spec. 🇺…☆11Feb 13, 2026Updated 2 months ago
- Custom kube-scheduler for binpacking targeting Spark on EKS and other jobs workloads☆28Feb 24, 2026Updated 2 months ago
- ☆14Dec 24, 2025Updated 4 months ago