A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational excellence, reliability and application specific best practices across Spark, Hive, Hudi, Hbase and more.
☆110Mar 24, 2026Updated this week
Alternatives and similar repositories for aws-emr-best-practices
Users that are interested in aws-emr-best-practices are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Feb 14, 2025Updated last year
- ☆43Mar 21, 2026Updated last week
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆68Jan 27, 2026Updated 2 months ago
- ☆26Mar 12, 2024Updated 2 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆39Feb 17, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Spark Structured Streaming Kinesis Data Streams connector supports both GetRecords and SubscribeToShard (Enhanced Fan-Out, EFO)☆39Mar 2, 2026Updated 3 weeks ago
- An Apache Spark Structured Streaming S3 connector for reading S3 files using Amazon S3 event notifications to AWS SQS☆15Feb 13, 2024Updated 2 years ago
- Application to securely map users on a multi tenant Amazon EMR cluster to different IAM Roles and then assume the mapped Role.☆24Oct 24, 2023Updated 2 years ago
- ☆24Oct 3, 2023Updated 2 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆53Oct 31, 2023Updated 2 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Dec 5, 2023Updated 2 years ago
- ☆18Nov 4, 2024Updated last year
- ☆13Feb 26, 2024Updated 2 years ago
- ☆157Feb 29, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆32Jan 30, 2026Updated 2 months ago
- Amazon EMR on EKS Custom Image CLI☆32Sep 26, 2024Updated last year
- Simple secret module for AWS Secrets Manager☆10Aug 16, 2022Updated 3 years ago
- Analyzing NBA Data☆11Feb 19, 2015Updated 11 years ago
- ☆56Mar 18, 2026Updated last week
- A UI client on top of fhir-works-on-aws-deployment☆15Apr 3, 2023Updated 2 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Jun 15, 2023Updated 2 years ago
- spark connector for Milvus☆16Jan 19, 2026Updated 2 months ago
- A DynamoDB implementation of the FHIR Works on AWS framework, enabling users to complete CRUD operations on FHIR resources☆27Apr 13, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆26Sep 10, 2024Updated last year
- This solution combines Amazon Pinpoint with Amazon SageMaker to help automate the process of collecting customer data, predicting custom…☆17Dec 17, 2020Updated 5 years ago
- ☆23Sep 3, 2024Updated last year
- Public README☆13Aug 2, 2025Updated 7 months ago
- Mirror of Apache Ranger☆15Apr 5, 2024Updated last year
- Example code for running Spark and Hive jobs on EMR Serverless.☆169Mar 11, 2026Updated 2 weeks ago
- ☆25Jul 4, 2023Updated 2 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- Helper tool for migrating from Vaadin Framework 7 to 8☆10Aug 4, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Apache Spark build compatible with AWS Glue Data Catalog.☆19Aug 9, 2021Updated 4 years ago
- A companion for the LeanStacks YouTube channel playlist entitled Spring Security Fundamentals.☆12May 16, 2016Updated 9 years ago
- ☆10Apr 5, 2024Updated last year
- This GenAI solution enables users to extract insights from diverse data formats (video, audio, PDFs, text) through a unified interface. U…☆17Feb 12, 2026Updated last month
- Performant Redshift data source for Apache Spark☆140Mar 17, 2026Updated last week
- aws-solutions-library-samples / guidance-for-natural-language-queries-of-relational-databases-on-awsDemonstration of Natural Language Query (NLQ) of an Amazon RDS for PostgreSQL database, using SageMaker JumpStart, Amazon Bedrock, LangCh…☆72Oct 19, 2024Updated last year
- ☆10Dec 13, 2023Updated 2 years ago