The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
☆28Jun 13, 2022Updated 3 years ago
Alternatives and similar repositories for pyspark-on-aws-emr
Users that are interested in pyspark-on-aws-emr are comparing it to the libraries listed below
Sorting:
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…☆13Jun 29, 2022Updated 3 years ago
- Challenge Data Engineer☆25Jun 13, 2022Updated 3 years ago
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆13Jun 13, 2022Updated 3 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆20Aug 5, 2022Updated 3 years ago
- Searching and Sorting Algorithms☆19Feb 27, 2026Updated last week
- PySpark Projects☆27Feb 3, 2026Updated last month
- Scripts to install Windows 11 25H2 on unsupported PCs (Fast/Advanced/Reset).☆34Oct 7, 2025Updated 5 months ago
- ITCS 6190 : Cloud Computing for Data Analysis project. Movie Recommendation Engine for Netflix Data with custom functions implementation …☆30Dec 8, 2017Updated 8 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- Data sets and ML models versioning example from DVC get started☆10Jun 4, 2024Updated last year
- Learn various Algorithms of Machine Learning like SVC, Decision Tree , Random Forest , Logistic Regression, Linear Regression and much Mo…☆11Jul 31, 2019Updated 6 years ago
- Natural Language Processing☆11Jun 23, 2021Updated 4 years ago
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Sep 5, 2023Updated 2 years ago
- coreless esp32 controlled drone☆10Mar 17, 2023Updated 2 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- A web based battery monitoring system for SP15S020 BMS☆12Jun 24, 2020Updated 5 years ago
- ☆11Jun 11, 2021Updated 4 years ago
- This is the end to end MLOps project I built through participated the MLOps Zoomcamp☆10Sep 11, 2022Updated 3 years ago
- Learn how to combine Nginx + wigs + load balancing + flask + unit testing + Docker☆12Jun 2, 2021Updated 4 years ago
- ☆10Aug 12, 2024Updated last year
- ☆10Dec 9, 2024Updated last year
- A simple sign language recognizer using SVM☆11Jun 21, 2022Updated 3 years ago
- https://liyasthomas.com☆16Jan 21, 2022Updated 4 years ago
- Anaconda plugin for StarCluster☆21Aug 14, 2024Updated last year
- GyverLamp ESP8266 Firmware☆10Feb 20, 2022Updated 4 years ago
- A scraper made using beautiful soup 4 in python. Tailor made for extracting news from moneycontrol.com. Issue pull request for different …☆12Jun 21, 2020Updated 5 years ago
- CH341A I2C MStar programmer☆15Apr 18, 2024Updated last year
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- Fundraiser Tracker implemented as AWS Lambda with ability to manage through Slack and autosync with Monobank and Privatbank☆10Apr 24, 2025Updated 10 months ago
- A GitBook about creating a GitBook for teaching☆10Apr 21, 2020Updated 5 years ago
- Movie Reviews Sentiment Analysis☆12Jun 28, 2018Updated 7 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 4 years ago
- Python package for parsing log lines in the logfmt style.☆20Nov 9, 2018Updated 7 years ago
- Tutorial about discovering and exploring hidden web APIs☆10Mar 13, 2019Updated 6 years ago
- ☆40Mar 2, 2026Updated last week
- ☆10Apr 25, 2021Updated 4 years ago
- Exploratory Data Analysis and Data Visualisation of All Space Missions from 1957 Dataset.☆12Jun 15, 2021Updated 4 years ago