This repository hold the Amazon Elastic MapReduce sample bootstrap actions
☆614Jun 5, 2023Updated 2 years ago
Alternatives and similar repositories for emr-bootstrap-actions
Users that are interested in emr-bootstrap-actions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Amazon Elastic MapReduce code samples☆63Sep 8, 2015Updated 10 years ago
- ☆894Jul 15, 2022Updated 3 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆120Mar 28, 2016Updated 10 years ago
- Amazon Redshift Database Loader implemented in AWS Lambda☆595Jul 16, 2024Updated last year
- Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment☆2,810Sep 3, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆328Mar 18, 2021Updated 5 years ago
- This repository hosts sample pipelines☆471May 8, 2020Updated 5 years ago
- Amazon Redshift Advanced Monitoring☆270Oct 28, 2025Updated 5 months ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Jun 15, 2023Updated 2 years ago
- REST job server for Apache Spark☆2,845Mar 3, 2026Updated last month
- Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB☆228Jan 15, 2026Updated 2 months ago
- Apache Spark on AWS Lambda☆158Dec 5, 2022Updated 3 years ago
- Redshift Python library for user agent detection (browsers, devices, etc) and parsing via UDFs☆10May 27, 2020Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆54Oct 3, 2023Updated 2 years ago
- A collection of example UDFs for Amazon Redshift.☆244Oct 25, 2024Updated last year
- Amazon Kinesis Client Library for Python☆376Dec 10, 2025Updated 4 months ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Feb 13, 2020Updated 6 years ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,361Sep 9, 2025Updated 7 months ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,150May 16, 2023Updated 2 years ago
- Client library for Amazon Kinesis☆660Updated this week
- functionstest☆33Oct 25, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Asynchronous Scala Clients for Amazon Web Services☆13Jul 31, 2017Updated 8 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,009Oct 5, 2022Updated 3 years ago
- AWS Glue code samples☆1,534Nov 5, 2025Updated 5 months ago
- Base classes to use when writing tests with Spark☆1,551Updated this week
- Secor is a service implementing Kafka log persistence☆1,859Mar 10, 2026Updated last month
- Mirror of Apache Toree (Incubating)☆749Apr 2, 2026Updated last week
- Scripts used to setup a Spark cluster on EC2☆387Nov 22, 2017Updated 8 years ago
- ☆760Mar 11, 2021Updated 5 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Oct 18, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆93Oct 1, 2020Updated 5 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- Deploy Spark cluster in an easy way.☆75Sep 13, 2016Updated 9 years ago
- Locality Sensitive Hashing for Apache Spark☆198Nov 1, 2016Updated 9 years ago
- Continuously monitors a set of log files and sends new data to the Amazon Kinesis Stream and Amazon Kinesis Firehose in near-real-time.☆373Mar 2, 2026Updated last month
- Read - Write JSON SerDe for Apache Hive.☆21Dec 4, 2018Updated 7 years ago
- AWS Lambda function to forward Stream data to Kinesis Firehose☆279Aug 16, 2023Updated 2 years ago