InsightDataScience / systems-puzzle
Systems Puzzle for the Insight DevOps Engineering program
☆6Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for systems-puzzle
- Collection of Terraform and Ansible scripts for easy AWS operations☆14Updated 4 years ago
- Quickstart PySpark with Anaconda on AWS/EMR☆53Updated 7 years ago
- ☆26Updated 10 months ago
- Use Kubernetes to autoscale your spark clusters.☆10Updated 5 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆61Updated last year
- ☆53Updated 7 years ago
- AWS Big Data Certification☆25Updated last year
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 6 years ago
- ☆13Updated 4 years ago
- Example of using Airflow to schedule downloading data form S3 and launching spark jobs☆15Updated 8 years ago
- Easy scaffolding for machine learning pipelines in Scikit-Learn☆7Updated 5 years ago
- ☆14Updated 3 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- Supporting code, Dockerfile, and Jupyter notebook for an end to end tutorial on Amazon SageMaker and EMR.☆28Updated 5 years ago
- The Open Source resources in Data Engineering, Machine Learning, Data Science areas, inspired by [The Open-Source Data Science Masters] (…☆8Updated 7 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 8 years ago
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆25Updated 7 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 4 years ago
- Essay on Amazon Redshift☆61Updated 6 years ago
- Cloudformation templates for deploying Airflow in ECS☆40Updated 5 years ago
- All the code related to building my own data lake☆22Updated last year
- Docker image for a Python installation with Spark, Hadoop and Sqoop binaries☆15Updated 6 years ago
- PREVIEW - Run Bonobo data processing graphs in docker containers.☆13Updated last year
- ☆13Updated 4 years ago
- Teaching materials for the Convolutional Neural Networks for Visual Recognition (http://cs231n.github.io/python-numpy-tutorial/) classes …☆24Updated 5 years ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- AWS Quick Start Team☆23Updated last month
- Directions and Source code for Insight's Docker workshop.☆22Updated 2 years ago