dimajix / terraform-emr-trainingLinks
Terraform script for launching multiple EMR clusters for training purposes.
☆16Updated last month
Alternatives and similar repositories for terraform-emr-training
Users that are interested in terraform-emr-training are comparing it to the libraries listed below
Sorting:
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 7 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.☆39Updated 6 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated 2 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆77Updated 7 years ago
- AWS Lambda function to ingest application logs from S3 Buckets into ElasticSearch for indexing☆58Updated 4 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS☆74Updated 2 months ago
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆99Updated 4 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Updated 2 years ago
- ☆14Updated 4 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated 2 years ago
- ☆22Updated 5 years ago
- Ansible role to install Apache Airflow☆86Updated 3 months ago
- ☆13Updated 5 years ago
- A toolset to streamline running spark python on EMR☆20Updated 9 years ago
- AWS Quick Start Team☆60Updated last year
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 6 years ago
- Secure Amazon ElasticSearch with AD/LDAP based Authentication & Authorization☆19Updated 7 years ago
- Reference Architectures for Datalakes on AWS☆78Updated 5 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- ☆19Updated 4 years ago
- The sane way of building a data layer in Airflow☆24Updated 6 years ago
- ☆52Updated 8 years ago
- This workshop is meant to give customers a hands-on experience with mentioned AWS services. Serverless Data Lake workshop helps customer…☆37Updated 4 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 3 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆25Updated 7 years ago
- A solution for near real-time monitoring of replication of objects in Amazon S3 between a source bucket and a destination bucket across m…☆39Updated 2 years ago
- AWS Quick Start Team☆19Updated last year