fscm / terraform-module-aws-spark
Terraform Module to create a Apache Spark cluster on AWS
☆16Updated 3 years ago
Alternatives and similar repositories for terraform-module-aws-spark:
Users that are interested in terraform-module-aws-spark are comparing it to the libraries listed below
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 6 years ago
- GitHub Action That Submits Argo Workflows For Execution on Your GKE Cluster☆16Updated 4 years ago
- A Pythonic API for Amazon's States Language for defining AWS Step Functions☆8Updated 2 years ago
- Docker image for Apache Hive running on Tez☆7Updated 10 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Example code from my Futures and Observables presentation☆21Updated 11 years ago
- A library for creating full representations of Mozilla telemetry pings.☆11Updated this week
- Automatically loads new partitions in AWS Athena☆18Updated 4 years ago
- A starter project to create Arc jobs using the Jupyter Notebook interface☆22Updated 4 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Updated last year
- This tool generates emulated data stream based on the NYC Taxi & Limousine Commission’s open dataset expanded with additional routing inf…☆13Updated 6 years ago
- Helps produce, consume, and collaborate on CloudEvents easier.☆21Updated 5 years ago
- A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.☆39Updated 5 years ago
- Content for the Athena Guide (https://athena.guide)☆10Updated 4 months ago
- Public presentations given by the Aiven staff☆14Updated 4 years ago
- A few end to end examples that use data-describe☆16Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Events about the open source data stack☆13Updated 2 years ago
- Outdated repository, moved to https://git.abolivier.bzh/babolivier/grafana-dashboards-manager☆16Updated 6 years ago
- Airflow code accompanying blog post.☆21Updated 6 years ago
- Generalized project for running Airflow DAGs, with possibility of skipping tasks already done for some set of input parameters.☆15Updated 2 years ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Deployment tools/scripts for Metaflow!☆56Updated last year
- A CLI tool to perform migrations on BigQuery tables☆11Updated 3 years ago
- Terraform module to create AWS ECR (Elastic Container Registry)☆11Updated last week
- An opinionated Kafka producer/consumer built on top of confluent-kafka-python/librdkafka☆27Updated last month
- ☆29Updated last year
- Log Management with Graylog, Elasticsearch, MongoDB, Nginx, Fluentd and Docker☆12Updated 6 years ago