aws-samples/amazon-emr-optimize-data-processing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-samples/amazon-emr-optimize-data-processing)

aws-samples / amazon-emr-optimize-data-processing

Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark

☆14

Alternatives and similar repositories for amazon-emr-optimize-data-processing

Users that are interested in amazon-emr-optimize-data-processing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / aws-blog-redshift-datalake-etl-elt-patterns
View on GitHub
Design best practices for building scalable ETL (Extract-Transform-Load) and ELT (Extract-Load-Transform) data processing pipelines using…
☆17Dec 8, 2019Updated 6 years ago
aws-samples / amazon-redshift-modernize-dw
View on GitHub
Can you set up a data warehouse and create a dashboard in under 60 minutes? In this workshop, we show you how with Amazon Redshift, a ful…
☆29Jul 9, 2019Updated 7 years ago
aws-samples / amazon-es-check-cw-alarms
View on GitHub
This sample code checks, and optionally creates, recommended CloudWatch alarms for your Amazon Elasticsearch service domain.
☆13Feb 16, 2018Updated 8 years ago
aws-samples / sql-based-etl-on-amazon-eks
View on GitHub
☆17Apr 9, 2024Updated 2 years ago
aws-samples / amazon-kinesis-analytics-streaming-etl
View on GitHub
Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics
☆65Oct 17, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
aws-samples / aws-cdk-emr-s3-trigger
View on GitHub
☆12Aug 14, 2024Updated last year
aws-samples / eks-rbac-sso
View on GitHub
This repository has configuration files to set up an open-source tool named Okta AWS CLI Assume Role Tool (https://github.com/oktadevelop…
☆10May 18, 2020Updated 6 years ago
aws-samples / amazon-lex-conversational-interface-for-twilio
View on GitHub
Use Amazon Lex as a conversational interface with Twilio Media Streams
☆13Feb 20, 2026Updated 5 months ago
aws-samples / amazon-redshift-data-warehouse-migration
View on GitHub
In few hours, quickly learn how to effectively migrate oracle data warehouse workload to Amazon Redshift using AWS Schema Conversion Tool…
☆10Dec 16, 2020Updated 5 years ago
aws-samples / aws-sagemaker-heart-disease-prediction
View on GitHub
This repository contains sample code that is used to demonstrate building, deploying and invoking a SageMaker model for heart disease pre…
☆10Oct 14, 2020Updated 5 years ago
aws-samples / dms-cloudformation-templates-generator
View on GitHub
Sample code that reads Microsoft Excel workbook/CSV File for the details required to create a DMS task CloudFormation template
☆14Jan 21, 2021Updated 5 years ago
aws-samples / enterprise-search-with-amazon-kendra-workshop
View on GitHub
☆13Aug 5, 2020Updated 5 years ago
awslabs / amazon-sagemaker-inference-client
View on GitHub
Amazon Sagemaker Object Detection Inference Endpoint visualization client web application hosted and managed by AWS Amplify.
☆20Mar 28, 2024Updated 2 years ago
aws-samples / sagemaker-workshop
View on GitHub
AWS Workshop for learning Amazon Sagemaker
☆12May 25, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
aws-samples / simple-phonebook-web-application
View on GitHub
☆11May 24, 2023Updated 3 years ago
aws-samples / redshift-immersionday-labs
View on GitHub
This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.
☆53Mar 31, 2021Updated 5 years ago
aws-samples / amazon-redshift-with-cloudformation
View on GitHub
Automate Redshift cluster creation with best practices using AWS CloudFormation
☆12Mar 3, 2022Updated 4 years ago
aws-samples / amazon-lex-bot-test
View on GitHub
Script to test an Amazon Lex bot using the Amazon Lex Runtime API.
☆13Aug 14, 2020Updated 5 years ago
richardanaya / spark_delta_lake
View on GitHub
☆16Jun 27, 2020Updated 6 years ago
aws-samples / amazon-redshift-tiered-storage
View on GitHub
Amazon Redshift offers a common query interface against data stored in fast, local storage as well as data from high-capacity, inexpensiv…
☆13Nov 26, 2018Updated 7 years ago
aws-samples / amazon-kinesis-data-analytics-flinktableapi
View on GitHub
A walkthrough of setting up a Kinesis Data Analytics for Java Application which ingest streaming JSON data and leverages the Flink Table …
☆16Aug 30, 2023Updated 2 years ago
aws-samples / amazon-sagemaker-analyze-model-predictions
View on GitHub
☆25Jan 5, 2021Updated 5 years ago
aws-samples / amazon-chime-sdk-pstn-integration
View on GitHub
☆16Jul 6, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
awslabs / amazon-s3-tagging-spark-util
View on GitHub
☆12Oct 16, 2023Updated 2 years ago
aws-samples / amazon-cloudwatch-prometheus-metrics-sample
View on GitHub
Sample code demonstrating Prometheus metrics ingestion into Amazon CloudWatch
☆17Mar 4, 2022Updated 4 years ago
alexbeletsky / tdd.demand
View on GitHub
A Web Crawler that crawle some job looking sites, analyze them, store data
☆19Jun 5, 2012Updated 14 years ago
aws-samples / dbtgluenyctaxidemo
View on GitHub
☆11Oct 11, 2022Updated 3 years ago
newfront / spark-intro-to-ml
View on GitHub
A Gentle introduction to Machine Learning with Apache Spark
☆11Mar 2, 2026Updated 4 months ago
amazon-archives / aws-amplify-material-ui-js-demo
View on GitHub
AWS Amplify, Material UI
☆16Jul 29, 2020Updated 5 years ago
joomcode / spark-platform
View on GitHub
Basic Spark utilities
☆13Feb 20, 2025Updated last year
aws-samples / amazon-qldb-dmv-sample-python
View on GitHub
A DMV based example application which demonstrates how to use QLDB with the QLDB Driver for Python.
☆27Feb 6, 2024Updated 2 years ago
aws-samples / amazon-kinesis-analytics-beam-taxi-consumer
View on GitHub
Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data str…
☆48Dec 19, 2025Updated 7 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
aws-samples / amplify-vue-search-example
View on GitHub
☆13May 7, 2021Updated 5 years ago
aws-samples / streaming-analytics-workshop
View on GitHub
Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time
☆34Jul 12, 2022Updated 4 years ago
jamartinh / Orange3-Spark
View on GitHub
A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML
☆15Dec 24, 2016Updated 9 years ago
aws-samples / amazon-S3-cache-with-amazon-elasticache-redis
View on GitHub
This sample project illustrates how you can cache Amazon S3 objects within Amazon ElastiCache for Redis.
☆21Mar 26, 2019Updated 7 years ago
aws-samples / build-a-360-degree-customer-view-with-aws
View on GitHub
☆17Jul 21, 2025Updated last year
aws-samples / data-profiler-for-aws-glue-data-catalog
View on GitHub
Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…
☆20May 13, 2020Updated 6 years ago
aws-samples / amazon-chime-sdk-amazon-connect-integration-demo
View on GitHub
Amazon Chime SDK and Amazon Connect Integration Demo
☆19Oct 5, 2023Updated 2 years ago