This repository hold the Amazon Elastic MapReduce sample bootstrap actions
☆613Jun 5, 2023Updated 2 years ago
Alternatives and similar repositories for emr-bootstrap-actions
Users that are interested in emr-bootstrap-actions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Amazon Elastic MapReduce code samples☆63Sep 8, 2015Updated 10 years ago
- ☆894Jul 15, 2022Updated 3 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆120Mar 28, 2016Updated 9 years ago
- Amazon Redshift Database Loader implemented in AWS Lambda☆595Jul 16, 2024Updated last year
- Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment☆2,811Sep 3, 2025Updated 6 months ago
- ☆328Mar 18, 2021Updated 5 years ago
- This repository hosts sample pipelines☆470May 8, 2020Updated 5 years ago
- Amazon Redshift Advanced Monitoring☆270Oct 28, 2025Updated 4 months ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Jun 15, 2023Updated 2 years ago
- REST job server for Apache Spark☆2,845Mar 3, 2026Updated 2 weeks ago
- Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB☆228Jan 15, 2026Updated 2 months ago
- Apache Spark on AWS Lambda☆157Dec 5, 2022Updated 3 years ago
- Compute the optimal number of shards for your Kinesis stream☆18Jan 10, 2019Updated 7 years ago
- Redshift Python library for user agent detection (browsers, devices, etc) and parsing via UDFs☆10May 27, 2020Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- A collection of example UDFs for Amazon Redshift.☆244Oct 25, 2024Updated last year
- Amazon Kinesis Client Library for Python☆376Dec 10, 2025Updated 3 months ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Feb 13, 2020Updated 6 years ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Sep 9, 2025Updated 6 months ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,147May 16, 2023Updated 2 years ago
- Client library for Amazon Kinesis☆660Updated this week
- functionstest☆33Oct 25, 2016Updated 9 years ago
- Asynchronous Scala Clients for Amazon Web Services☆13Jul 31, 2017Updated 8 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- AWS Glue code samples☆1,535Nov 5, 2025Updated 4 months ago
- Base classes to use when writing tests with Spark☆1,549Dec 22, 2025Updated 3 months ago
- Secor is a service implementing Kafka log persistence☆1,857Mar 10, 2026Updated last week
- Mirror of Apache Toree (Incubating)☆749Mar 9, 2026Updated 2 weeks ago
- Scripts used to setup a Spark cluster on EC2☆387Nov 22, 2017Updated 8 years ago
- ☆760Mar 11, 2021Updated 5 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Oct 18, 2023Updated 2 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆93Oct 1, 2020Updated 5 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- Locality Sensitive Hashing for Apache Spark☆198Nov 1, 2016Updated 9 years ago
- Continuously monitors a set of log files and sends new data to the Amazon Kinesis Stream and Amazon Kinesis Firehose in near-real-time.☆372Mar 2, 2026Updated 3 weeks ago
- Read - Write JSON SerDe for Apache Hive.☆21Dec 4, 2018Updated 7 years ago
- AWS Lambda function to forward Stream data to Kinesis Firehose☆279Aug 16, 2023Updated 2 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago