lynnlangit / Spark-Scala-EKS
Spark Scala docker container sample for AWS testing - EKS & S3
☆24Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Spark-Scala-EKS
- ☆45Updated 6 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated last year
- Docker image to submit Spark applications☆38Updated 6 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 7 months ago
- In-deprecation. For Lenses please check lensesio/lenses-helm-charts. Soon Stream Reactor will also get its own Helm repository.☆70Updated 4 years ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆70Updated 9 months ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Updated last year
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated 11 months ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆61Updated last year
- A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.☆39Updated 5 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Updated 6 years ago
- Automatically loads new partitions in AWS Athena☆18Updated 4 years ago
- Bash completion for Kafka command line utilities.☆34Updated 6 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated last year
- Amazon Elastic MapReduce code samples☆63Updated 9 years ago
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆27Updated 7 years ago
- Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS☆72Updated last month
- Deploy Presto on the cloud easily, using Terraform and Packer☆44Updated last year
- A Helm Chart for Apache Airflow☆14Updated 6 years ago
- Minikube for big data with Scala and Spark☆15Updated 5 years ago
- ☆30Updated last year
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Updated 5 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆66Updated 9 months ago