fscm / terraform-module-aws-sparkLinks
Terraform Module to create a Apache Spark cluster on AWS
☆16Updated 3 years ago
Alternatives and similar repositories for terraform-module-aws-spark
Users that are interested in terraform-module-aws-spark are comparing it to the libraries listed below
Sorting:
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- T4 is now in production as Quilt 3☆64Updated 6 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 9 months ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- An ML project template with sensible defaults☆37Updated 3 years ago
- A Terraform module to deploy and run YugabyteDB on AWS.☆21Updated last month
- Generalized project for running Airflow DAGs, with possibility of skipping tasks already done for some set of input parameters.☆16Updated 2 years ago
- GitHub Action That Submits Argo Workflows For Execution on Your GKE Cluster☆16Updated 4 years ago
- Instructions for deploying Kubeflow on EKS and minikube☆15Updated 4 years ago
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 3 years ago
- All the code related to building my own data lake☆21Updated 2 years ago
- A Cloud Native Query Engine. Serverless, if it fits your case.☆54Updated 2 years ago
- pytest support for airflow☆12Updated 4 years ago
- The blog post about Kubeflow, including all materials☆31Updated 3 months ago
- AWS Quick Start Team☆23Updated 11 months ago
- Ansible role to deploy and configure Airflow☆41Updated 2 weeks ago
- Content for the Athena Guide (https://athena.guide)☆11Updated 10 months ago
- ☆31Updated last year
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- A Github API client to extract events and actions, and load into a database☆28Updated 3 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆85Updated 2 years ago
- A Delta Lake reader for Dask☆53Updated last month
- Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/cour…☆46Updated last year
- A solution enabling customers to quickly deploy an architecture to identify and mask sensitive health data☆26Updated 2 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- The stupidest database of all time.☆56Updated last month
- Functional Airflow DAG definitions.☆38Updated 8 years ago
- Deployment tools/scripts for Metaflow!☆56Updated 2 years ago
- 🚀 Deploy Kubeflow on AWS EKS with Terraform 🤖☆65Updated 2 years ago