fscm / terraform-module-aws-sparkLinks
Terraform Module to create a Apache Spark cluster on AWS
☆16Updated 3 years ago
Alternatives and similar repositories for terraform-module-aws-spark
Users that are interested in terraform-module-aws-spark are comparing it to the libraries listed below
Sorting:
- T4 is now in production as Quilt 3☆64Updated 6 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- Small Docker image with Python Machine Learning tools (~180MB) https://hub.docker.com/r/frolvlad/alpine-python-machinelearning/☆81Updated 4 months ago
- A solution enabling customers to quickly deploy an architecture to identify and mask sensitive health data☆26Updated 2 years ago
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- An ML project template with sensible defaults☆37Updated 3 years ago
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆13Updated 3 years ago
- 📚 The Bayes Way 🎓☆33Updated 2 years ago
- Deployment tools/scripts for Metaflow!☆56Updated 2 years ago
- The blog post about Kubeflow, including all materials☆31Updated 2 months ago
- Query GitHub API v4 using GraphQL☆15Updated 7 years ago
- ☆15Updated 7 years ago
- Quickly compare changes made to Jupyter notebooks in GitHub repositories with jupydiff!☆13Updated 2 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 8 months ago
- Open source Flotilla☆195Updated 2 weeks ago
- All the code related to building my own data lake☆21Updated 2 years ago
- Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/cour…☆46Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 7 years ago
- Primrose modeling framework for simple production models☆32Updated last year
- Create an nteractive application with zero configuration☆36Updated last year
- Know your ML Score based on Sculley's paper☆34Updated 6 years ago
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 3 years ago
- ☆30Updated last year
- Content for the Athena Guide (https://athena.guide)☆11Updated 9 months ago
- Functional Airflow DAG definitions.☆38Updated 8 years ago
- ☆59Updated 3 years ago
- Unit and integration testing with PySpark can be tough to figure out, let's make that easier.☆23Updated 9 years ago
- Instructions for deploying Kubeflow on EKS and minikube☆15Updated 4 years ago
- Data science tool for creating and deploying pipelines with versioned data☆45Updated last year
- The stupidest database of all time.☆55Updated last week