A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational excellence, reliability and application specific best practices across Spark, Hive, Hudi, Hbase and more.
☆110Apr 5, 2026Updated last month
Alternatives and similar repositories for aws-emr-best-practices
Users that are interested in aws-emr-best-practices are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Feb 14, 2025Updated last year
- ☆45May 22, 2026Updated last week
- ☆26Apr 26, 2026Updated last month
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆39Feb 17, 2025Updated last year
- Spark Structured Streaming Kinesis Data Streams connector supports both GetRecords and SubscribeToShard (Enhanced Fan-Out, EFO)☆40May 19, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Application to securely map users on a multi tenant Amazon EMR cluster to different IAM Roles and then assume the mapped Role.☆24Oct 24, 2023Updated 2 years ago
- This repository contains the dbt-glue adapter☆143May 1, 2026Updated 3 weeks ago
- ☆24Oct 3, 2023Updated 2 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆53Oct 31, 2023Updated 2 years ago
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆29Dec 22, 2020Updated 5 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Dec 5, 2023Updated 2 years ago
- ☆14Feb 26, 2024Updated 2 years ago
- ☆157Feb 29, 2024Updated 2 years ago
- Amazon EMR on EKS Custom Image CLI☆32Sep 26, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Analyzing NBA Data☆12Feb 19, 2015Updated 11 years ago
- Deploy Jupyter Notebook to AWS Lambda☆16Nov 18, 2020Updated 5 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Jun 15, 2023Updated 2 years ago
- ☆22Oct 18, 2023Updated 2 years ago
- spark connector for Milvus☆16May 23, 2026Updated last week
- ☆17Oct 15, 2020Updated 5 years ago
- ☆21May 19, 2026Updated last week
- The dbt-spark-livy adapter allows you to use dbt along with Apache Spark, by connecting via Apache Livy☆12Mar 30, 2023Updated 3 years ago
- This solution combines Amazon Pinpoint with Amazon SageMaker to help automate the process of collecting customer data, predicting custom…☆17Dec 17, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆23Sep 3, 2024Updated last year
- Scripts and instructions to facilitate running Deep Learning Tasks on Amazon EMR☆63Nov 9, 2023Updated 2 years ago
- Mirror of Apache Ranger☆15Apr 5, 2024Updated 2 years ago
- ☆17Mar 18, 2025Updated last year
- ☆12May 18, 2019Updated 7 years ago
- ☆17Dec 31, 2025Updated 4 months ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆170May 14, 2026Updated 2 weeks ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- Helper tool for migrating from Vaadin Framework 7 to 8☆10Aug 4, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆75Jun 8, 2023Updated 2 years ago
- Openathon VI - Custom Software Engineering☆14Jan 9, 2025Updated last year
- Apache Spark build compatible with AWS Glue Data Catalog.☆19Aug 9, 2021Updated 4 years ago
- A companion for the LeanStacks YouTube channel playlist entitled Spring Security Fundamentals.☆12May 16, 2016Updated 10 years ago
- ☆10Apr 5, 2024Updated 2 years ago
- Performant Redshift data source for Apache Spark☆140Mar 17, 2026Updated 2 months ago
- aws-solutions-library-samples / guidance-for-natural-language-queries-of-relational-databases-on-awsDemonstration of Natural Language Query (NLQ) of an Amazon RDS for PostgreSQL database, using SageMaker JumpStart, Amazon Bedrock, LangCh…☆72Oct 19, 2024Updated last year