dimajix / terraform-emr-trainingLinks
Terraform script for launching multiple EMR clusters for training purposes.
☆16Updated last year
Alternatives and similar repositories for terraform-emr-training
Users that are interested in terraform-emr-training are comparing it to the libraries listed below
Sorting:
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 6 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 6 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated 2 years ago
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆98Updated 4 years ago
- ☆21Updated 2 months ago
- ☆14Updated 4 years ago
- A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.☆39Updated 5 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- AWS Lambda function to ingest application logs from S3 Buckets into ElasticSearch for indexing☆59Updated 4 years ago
- ☆13Updated 5 years ago
- AWS Quick Start Team☆19Updated 11 months ago
- In this pattern, data records are ingested and then modified with simple transformations such as field level substitutions and data enric…☆14Updated 6 years ago
- A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.☆38Updated last year
- This workshop is meant to give customers a hands-on experience with mentioned AWS services. Serverless Data Lake workshop helps customer…☆37Updated 4 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- Material for re:Invent 2017 - CMP316 - Workshop: Hedge Your Own Funds: Run Monte Carlo Simulations on Amazon EC2 Spot Fleets☆43Updated 5 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 7 months ago
- ☆22Updated 5 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated last year
- Code accompanying AWS re:Invent workshop DEV 303 showcasing how to get deep application insights using Amazon EKS with AWS X-Ray and Amaz…☆57Updated 3 years ago
- Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data str…☆47Updated last year
- S3 Snapshot script to run from command-line or scheduled in Lambda.☆28Updated 6 years ago
- AWS Quick Start Team☆23Updated 11 months ago
- Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS☆74Updated last month
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 3 years ago
- ☆73Updated last year
- Reference Architectures for Datalakes on AWS☆78Updated 5 years ago
- Sagemaker pipeline for AWS Summit New York☆58Updated 5 years ago
- A sample AWS Step Functions (SFN) state machine, designed to one-way synchronize an Amazon S3 source bucket into another S3 destination b…☆111Updated 6 years ago