dimajix / terraform-emr-trainingLinks
Terraform script for launching multiple EMR clusters for training purposes.
☆16Updated last week
Alternatives and similar repositories for terraform-emr-training
Users that are interested in terraform-emr-training are comparing it to the libraries listed below
Sorting:
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 7 years ago
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆99Updated 4 years ago
- A toolset to streamline running spark python on EMR☆20Updated 8 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker…☆84Updated 2 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- AWS Lambda function to ingest application logs from S3 Buckets into ElasticSearch for indexing☆59Updated 4 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆25Updated 7 years ago
- Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS☆74Updated last month
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated 2 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Updated 2 years ago
- Terraform provider for interacting with NiFi cluster☆51Updated 6 years ago
- Puppet module to provision Airbnb's Airflow☆20Updated 3 years ago
- ☆21Updated 4 months ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆77Updated 7 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- ☆14Updated 4 years ago
- AWS Quick Start Team☆60Updated last year
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 10 months ago
- A collection of airflow sample workflows for data processing on aws☆12Updated 7 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Amazon ECS Interstella Workshops CON209/318/319/407☆55Updated 5 years ago
- In this pattern, data records are ingested and then modified with simple transformations such as field level substitutions and data enric…☆14Updated 6 years ago
- A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.☆39Updated 6 years ago
- Python library for AWS pricing.☆216Updated 2 years ago
- ☆14Updated 5 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 8 years ago
- Curated list of resources about Apache Airflow☆19Updated 4 years ago
- This service is meant to simplify running Google Cloud operations, especially BigQuery tasks. This means you do not have to worry about …☆46Updated 6 years ago
- Cloudformation templates for deploying Airflow in ECS☆40Updated 6 years ago