aws-samples/emr-bootstrap-actions

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-samples/emr-bootstrap-actions)

aws-samples / emr-bootstrap-actions

This repository hold the Amazon Elastic MapReduce sample bootstrap actions

☆613

Alternatives and similar repositories for emr-bootstrap-actions

Users that are interested in emr-bootstrap-actions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-archives / emr-sample-apps
View on GitHub
Amazon Elastic MapReduce code samples
☆63Sep 8, 2015Updated 10 years ago
databricks / spark-redshift
View on GitHub
Redshift data source for Apache Spark
☆608Aug 10, 2023Updated 2 years ago
aws-samples / aws-big-data-blog
View on GitHub
☆893Jul 15, 2022Updated 4 years ago
databricks / spark-avro
View on GitHub
Avro Data Source for Apache Spark
☆537Dec 19, 2018Updated 7 years ago
snowplow-archive / spark-example-project
View on GitHub
A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR
☆119Mar 28, 2016Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
awslabs / aws-lambda-redshift-loader
View on GitHub
Amazon Redshift Database Loader implemented in AWS Lambda
☆595Jul 16, 2024Updated 2 years ago
databricks / spark-csv
View on GitHub
CSV Data Source for Apache Spark 1.x
☆1,057Dec 13, 2018Updated 7 years ago
awslabs / amazon-redshift-utils
View on GitHub
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
☆2,811Sep 3, 2025Updated 10 months ago
awsdocs / amazon-emr-management-guide
View on GitHub
The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…
☆62Jun 15, 2023Updated 3 years ago
amazon-archives / amazon-kinesis-connectors
View on GitHub
☆327Mar 18, 2021Updated 5 years ago
amazon-archives / data-pipeline-samples
View on GitHub
This repository hosts sample pipelines
☆472May 8, 2020Updated 6 years ago
awslabs / amazon-redshift-monitoring
View on GitHub
Amazon Redshift Advanced Monitoring
☆269Oct 28, 2025Updated 8 months ago
Cascading / scalding-tutorial
View on GitHub
The Scalding tutorial as a standalone SBT project
☆51Oct 16, 2017Updated 8 years ago
awslabs / emr-dynamodb-connector
View on GitHub
Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB
☆228Apr 8, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
adobe-research / spark-cluster-deployment
View on GitHub
Automates Spark standalone cluster tasks with Puppet and Fabric.
☆43Aug 14, 2014Updated 11 years ago
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,837Mar 3, 2026Updated 4 months ago
databricks / spark-knowledgebase
View on GitHub
Spark Knowledge Base
☆333Oct 1, 2020Updated 5 years ago
ajlai / uap-redshift
View on GitHub
Redshift Python library for user agent detection (browsers, devices, etc) and parsing via UDFs
☆10May 27, 2020Updated 6 years ago
qubole / spark-on-lambda
View on GitHub
Apache Spark on AWS Lambda
☆158Dec 5, 2022Updated 3 years ago
ComcastSamples / KinesisShardCalculator
View on GitHub
Compute the optimal number of shards for your Kinesis stream
☆18Jan 10, 2019Updated 7 years ago
awslabs / aws-emr-launch
View on GitHub
☆54Oct 3, 2023Updated 2 years ago
aws-samples / amazon-redshift-udfs
View on GitHub
A collection of example UDFs for Amazon Redshift.
☆244Updated this week
mozilla / emr-bootstrap-spark
View on GitHub
AWS bootstrap scripts for Mozilla's flavoured Spark setup.
☆47Feb 13, 2020Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
awslabs / amazon-kinesis-client-python
View on GitHub
Amazon Kinesis Client Library for Python
☆376Jun 2, 2026Updated last month
awslabs / amazon-kinesis-client
View on GitHub
Client library for Amazon Kinesis
☆664Jul 15, 2026Updated last week
jupyter-incubator / sparkmagic
View on GitHub
Jupyter magics and kernels for working with remote Spark clusters
☆1,364Sep 9, 2025Updated 10 months ago
spark-notebook / spark-notebook
View on GitHub
Interactive and Reactive Data Science using Scala and Spark.
☆3,142May 16, 2023Updated 3 years ago
amazon-archives / kinesis-storm-spout
View on GitHub
Kinesis spout for Storm
☆108Mar 28, 2018Updated 8 years ago
yodasco / pyspark-emr
View on GitHub
A toolset to streamline running spark python on EMR
☆20Nov 16, 2016Updated 9 years ago
cloudera / livy
View on GitHub
Livy is an open source REST interface for interacting with Apache Spark from anywhere
☆1,008Oct 5, 2022Updated 3 years ago
aws-samples / aws-glue-samples
View on GitHub
AWS Glue code samples
☆1,539Jun 8, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hbutani / spark-datetime
View on GitHub
functionstest
☆33Oct 25, 2016Updated 9 years ago
jwplayer / sparksteps
View on GitHub
CLI tool to launch Spark jobs on AWS EMR
☆67Oct 18, 2023Updated 2 years ago
holdenk / spark-testing-base
View on GitHub
Base classes to use when writing tests with Spark
☆1,553Apr 20, 2026Updated 3 months ago
mingchuno / aws-wrap
View on GitHub
Asynchronous Scala Clients for Amazon Web Services
☆13Jul 31, 2017Updated 8 years ago
apache / incubator-toree
View on GitHub
Mirror of Apache Toree (Incubating)
☆751Updated this week
hakanilter / aws-emr-examples
View on GitHub
Some AWS EMR examples
☆16Jan 18, 2018Updated 8 years ago
pinterest / secor
View on GitHub
Secor is a service implementing Kafka log persistence
☆1,857Mar 10, 2026Updated 4 months ago