Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark
☆14Apr 14, 2023Updated 2 years ago
Alternatives and similar repositories for amazon-emr-optimize-data-processing
Users that are interested in amazon-emr-optimize-data-processing are comparing it to the libraries listed below
Sorting:
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Oct 17, 2023Updated 2 years ago
- Design best practices for building scalable ETL (Extract-Transform-Load) and ELT (Extract-Load-Transform) data processing pipelines using…☆17Dec 8, 2019Updated 6 years ago
- Can you set up a data warehouse and create a dashboard in under 60 minutes? In this workshop, we show you how with Amazon Redshift, a ful…☆29Jul 9, 2019Updated 6 years ago
- ☆17Apr 9, 2024Updated last year
- ☆12Aug 14, 2024Updated last year
- This repository has configuration files to set up an open-source tool named Okta AWS CLI Assume Role Tool (https://github.com/oktadevelop…☆10May 18, 2020Updated 5 years ago
- This sample code checks, and optionally creates, recommended CloudWatch alarms for your Amazon Elasticsearch service domain.☆13Feb 16, 2018Updated 8 years ago
- Use Amazon Lex as a conversational interface with Twilio Media Streams☆13Feb 20, 2026Updated last month
- In few hours, quickly learn how to effectively migrate oracle data warehouse workload to Amazon Redshift using AWS Schema Conversion Tool…☆10Dec 16, 2020Updated 5 years ago
- Sample code that reads Microsoft Excel workbook/CSV File for the details required to create a DMS task CloudFormation template☆14Jan 21, 2021Updated 5 years ago
- This repository contains sample code that is used to demonstrate building, deploying and invoking a SageMaker model for heart disease pre…☆10Oct 14, 2020Updated 5 years ago
- ☆13Aug 5, 2020Updated 5 years ago
- Amazon Sagemaker Object Detection Inference Endpoint visualization client web application hosted and managed by AWS Amplify.☆20Mar 28, 2024Updated last year
- AWS Workshop for learning Amazon Sagemaker☆12May 25, 2021Updated 4 years ago
- ☆10May 24, 2023Updated 2 years ago
- ☆13Jul 6, 2020Updated 5 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Mar 31, 2021Updated 4 years ago
- Automate Redshift cluster creation with best practices using AWS CloudFormation☆12Mar 3, 2022Updated 4 years ago
- Script to test an Amazon Lex bot using the Amazon Lex Runtime API.☆13Aug 14, 2020Updated 5 years ago
- This is a classic three-tier application written in Java 8 to easily upload and share photos, the application is just for demo purposes t…☆26Apr 14, 2023Updated 2 years ago
- ☆16Jun 27, 2020Updated 5 years ago
- Amazon Redshift offers a common query interface against data stored in fast, local storage as well as data from high-capacity, inexpensiv…☆13Nov 26, 2018Updated 7 years ago
- A walkthrough of setting up a Kinesis Data Analytics for Java Application which ingest streaming JSON data and leverages the Flink Table …☆16Aug 30, 2023Updated 2 years ago
- Sample code demonstrating Prometheus metrics ingestion into Amazon CloudWatch☆17Mar 4, 2022Updated 4 years ago
- ☆11Oct 11, 2022Updated 3 years ago
- AWS Amplify, Material UI☆16Jul 29, 2020Updated 5 years ago
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 2 weeks ago
- A DMV based example application which demonstrates how to use QLDB with the QLDB Driver for Python.☆27Feb 6, 2024Updated 2 years ago
- Monitor how many EC2 instances are running across all regions with a simple dashboard.☆32Jan 25, 2022Updated 4 years ago
- ☆13May 7, 2021Updated 4 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Jul 12, 2022Updated 3 years ago
- This sample project illustrates how you can cache Amazon S3 objects within Amazon ElastiCache for Redis.☆21Mar 26, 2019Updated 6 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- ☆17Jul 21, 2025Updated 8 months ago
- Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data str…☆47Dec 19, 2025Updated 3 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20May 13, 2020Updated 5 years ago
- ☆12Oct 16, 2023Updated 2 years ago
- A collection of old versions of the Haskell Report☆13Aug 17, 2017Updated 8 years ago
- Amazon Chime SDK and Amazon Connect Integration Demo☆19Oct 5, 2023Updated 2 years ago