aws-samples/emr-spark-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-samples/emr-spark-benchmark)

aws-samples / emr-spark-benchmark

☆26

Alternatives and similar repositories for emr-spark-benchmark

Users that are interested in emr-spark-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / emr-on-eks-benchmark
View on GitHub
☆32Jul 2, 2026Updated 3 weeks ago
aws-samples / amazon-redshift-streaming-workshop
View on GitHub
This repository provides the resources required for the Amazon Redshift Streaming workshop
☆13Apr 13, 2026Updated 3 months ago
awslabs / migration-hadoop-to-emr-tco-simulator
View on GitHub
☆20May 21, 2024Updated 2 years ago
bluishglc / emr-edgenode-maker
View on GitHub
This tool can easily make / build an emr cluster edge node / client node / gateway node
☆10Jun 1, 2022Updated 4 years ago
awslabs / aws-glue-streaming-libs
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
aws-samples / emr-remote-shuffle-service
View on GitHub
☆18May 7, 2026Updated 2 months ago
aws / aws-emr-best-practices
View on GitHub
A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…
☆110Apr 5, 2026Updated 3 months ago
aws-samples / emr-serverless-samples
View on GitHub
Example code for running Spark and Hive jobs on EMR Serverless.
☆171Jul 8, 2026Updated 2 weeks ago
aws-samples / aws-glue-streaming-ingestion-from-kafka-to-apache-iceberg
View on GitHub
This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (M…
☆16Sep 10, 2024Updated last year
aws-samples / retail-large-data-ml-e2e
View on GitHub
小売業で予測ベースの発注を実現するためのサンプルソリューション
☆17Apr 10, 2025Updated last year
aws-samples / eks-event-watcher
View on GitHub
☆13Aug 12, 2022Updated 3 years ago
aws-samples / emr-studio-notebook-examples
View on GitHub
This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.
☆53Oct 31, 2023Updated 2 years ago
aws-ia / terraform-aws-eks-data-addons
View on GitHub
Terraform Module: Deploy Data/ML Addons Helm Charts on EKS 🚀
☆46Aug 7, 2025Updated 11 months ago
aws-samples / aws-emr-utilities
View on GitHub
☆45Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
awsdocs / amazon-emr-management-guide
View on GitHub
The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…
☆62Jun 15, 2023Updated 3 years ago
logan-bobo / rds_auto_encrypt
View on GitHub
Encrypt RDS instances that were previously created in an unencrypted state
☆11Aug 27, 2022Updated 3 years ago
aws-samples / amazon-redshift-infrastructure-automation
View on GitHub
☆26Aug 8, 2024Updated last year
aws-samples / observability-driven-development
View on GitHub
☆12Dec 7, 2025Updated 7 months ago
kaloureyes3 / v4-clients
View on GitHub
☆10Apr 5, 2024Updated 2 years ago
opensearch-project / time-series-db
View on GitHub
OpenSearch-TSDB
☆15Jul 16, 2026Updated last week
aws-samples / aws-emr-apache-ranger
View on GitHub
☆24Oct 3, 2023Updated 2 years ago
JerryLead / SparkProfiler
View on GitHub
Profiling Spark Applications for Performance Comparison and Diagnosis
☆16Nov 11, 2018Updated 7 years ago
aws-john / simple-lambda-stopinator-for-ec2
View on GitHub
Simple Lambda Stopinator: Start/Stop EC2 instances on schedule or duration
☆11May 25, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
angelmaroco / spark-on-aws-eks-with-karpenter
View on GitHub
WIP - Scaling Spark Data Platform with EKS. The solution uses Karpenter and Cluster Autoscaler, Yunikorn for advanced scheduling.
☆16May 9, 2023Updated 3 years ago
awslabs / aws-proton-plugins-for-backstage
View on GitHub
☆48Mar 4, 2024Updated 2 years ago
aws-samples / emr-trino-autoscale
View on GitHub
☆23Feb 14, 2025Updated last year
coroot / coroot-aws-agent
View on GitHub
A prometheus exporter that gathers metrics from AWS services.
☆19Oct 16, 2025Updated 9 months ago
olympiaformat / olympia
View on GitHub
Olympia is a storage-only open catalog format for big data analytics, ML & AI.
☆16May 5, 2025Updated last year
aws-samples / amazon-connect-contact-lens-rules-library
View on GitHub
☆18Jan 8, 2024Updated 2 years ago
aws-samples / amazon-dynamodb-item-tagging
View on GitHub
☆18Jun 20, 2024Updated 2 years ago
aws / Unified-Studio-for-Amazon-Sagemaker
View on GitHub
☆57Apr 21, 2026Updated 3 months ago
aws-samples / amazon-managed-service-for-apache-flink-examples
View on GitHub
Collection of code examples for Amazon Managed Service for Apache Flink
☆90Jun 16, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aws-samples / aws-fargate-falco-examples
View on GitHub
☆14May 19, 2023Updated 3 years ago
kubeflow / mcp-apache-spark-history-server
View on GitHub
MCP Server and CLI for Apache Spark History Server. Debug Spark applications from AI agents, scripts, or the terminal.
☆183Jul 16, 2026Updated last week
awslabs / s3-tables-catalog
View on GitHub
The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…
☆154Jan 26, 2026Updated 5 months ago
aws-samples / eks-spark-benchmark
View on GitHub
Performance optimization for Spark running on Kubernetes
☆87Aug 18, 2020Updated 5 years ago
prosto / ray-haystack
View on GitHub
Run Haystack Pipelines on Ray
☆20Oct 16, 2024Updated last year
AdminTurnedDevOps / agentic-demo-repo
View on GitHub
Example code/configs for all frameworks (CrewAI, kagent, ADK, etc.), MCP, fine-tuning, and AI gateways
☆43Updated this week
spark-redshift-community / spark-redshift
View on GitHub
Performant Redshift data source for Apache Spark
☆140Jun 5, 2026Updated last month