garystafford / emr-demoView external linksLinks
Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
☆38Sep 1, 2022Updated 3 years ago
Alternatives and similar repositories for emr-demo
Users that are interested in emr-demo are comparing it to the libraries listed below
Sorting:
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Jul 6, 2022Updated 3 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 3 months ago
- The proposed solution shows and approach to unify and centralize logs across different compute platforms like EC2, ECS, EKS and Lambda wi…☆14Oct 17, 2023Updated 2 years ago
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Jul 31, 2022Updated 3 years ago
- Python script to automatically sync new instances via AWS CodeDeploy APIs☆16Jan 14, 2026Updated last month
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆29Dec 22, 2020Updated 5 years ago
- ☆17Oct 15, 2020Updated 5 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆14Oct 26, 2021Updated 4 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆48Jan 7, 2025Updated last year
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆18Sep 17, 2018Updated 7 years ago
- This reference architecture demonstrates the use of AWS Step Functions to orchestrate an Extract Transfer Load (ETL) workflow with AWS La…☆24Jun 16, 2020Updated 5 years ago
- AWS Glue tutorial for data developers.☆23Sep 2, 2019Updated 6 years ago
- ☆27Dec 17, 2020Updated 5 years ago
- Supporting code, Dockerfile, and Jupyter notebook for an end to end tutorial on Amazon SageMaker and EMR.☆28Jan 14, 2026Updated last month
- ☆27Dec 8, 2022Updated 3 years ago
- A browser for open-data-registry: https://github.com/awslabs/open-data-registry☆40Updated this week
- ☆12Sep 23, 2025Updated 4 months ago
- ☆11Jun 12, 2023Updated 2 years ago
- An AWS based solution using AWS CloudWatch and AWS Lambda based on Python to automatically terminate AWS EMR clusters that have been idle…☆26Jun 5, 2024Updated last year
- Women Who Code stuff☆12Dec 10, 2019Updated 6 years ago
- ☆34Dec 12, 2022Updated 3 years ago
- Follow this "Urban Roast" demo tutorial series to power your app with Visualize.js!☆11Oct 10, 2023Updated 2 years ago
- Prototype Pandemic Unemployment Assistance (PUA) claim service☆12Dec 2, 2021Updated 4 years ago
- Ansible playbooks for Linux-HA Japan Pacemaker repository package.☆10Jul 13, 2020Updated 5 years ago
- ☆10Mar 31, 2025Updated 10 months ago
- ☆14Sep 14, 2021Updated 4 years ago
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Sep 5, 2023Updated 2 years ago
- Event-driven, Serverless Architectures with AWS Lambda, SQS, DynamoDB, and API Gateway☆35Jul 15, 2021Updated 4 years ago
- Reference implementation on labeling video frames using Amazon Rekognition. The repo also contains some OpenCV based video utilities for …☆41Jan 14, 2026Updated last month
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- A Python library to simplify batch requests to AWS Services☆12Apr 25, 2020Updated 5 years ago
- Odoo project related addons☆12Dec 15, 2020Updated 5 years ago
- My applied big data analytic project with pyspark.☆10Sep 21, 2022Updated 3 years ago
- ☆11Oct 30, 2024Updated last year
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- Power Plant ML Pipeline Application - Apache Spark☆12Dec 12, 2016Updated 9 years ago
- ☆15Jul 2, 2024Updated last year
- ☆10May 16, 2022Updated 3 years ago
- This repository contains code written in the AWS Cloud Development Kit (CDK) which launches infrastructure across two different regions t…☆12Mar 10, 2022Updated 3 years ago