MHassaanButt / Flight-Delays-Prediction
In this project, I used Decision Tree Learning Model as the main algorithm to build the model. Due to the big amount of flight data, we implement the project using MRJob, PySpark and Spark's MLlib then compare the performance and accuracy of those implementations.
☆11Updated 3 years ago
Alternatives and similar repositories for Flight-Delays-Prediction:
Users that are interested in Flight-Delays-Prediction are comparing it to the libraries listed below
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆29Updated 4 years ago
- Use Multiple Linear Regression, Python, Pandas, and Matplotlib to analyze the lifetime value and the key factors of the ‘Telco Customer C…☆10Updated 4 years ago
- ☆16Updated 2 years ago
- Develop ML models predict taxi trip duration in NYC. Ranked : Top 6% | RMSLE : 0.377 (Kaggle) | #DS☆17Updated 2 years ago
- My applied big data analytic project with pyspark.☆10Updated 2 years ago
- This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data en…☆17Updated this week
- Kubeflow installation on windows 10/11☆17Updated 2 years ago
- Build ML model with meaningful variables. Use model for predictions☆18Updated 2 years ago
- The goal of this project is to build an unsupervised machine learning model that predicts customers' next purchase date.☆19Updated 3 years ago
- PySpark Projects☆23Updated this week
- Some of my sql projects with sqlite.☆10Updated 3 years ago
- ☆14Updated 3 years ago
- A small repository explaining how you can validate your linear regression model based on assumptions☆13Updated 3 years ago
- Customer-base segmentation over e-commerce sales data☆26Updated 5 years ago
- Turning salesforce lead, oppty, & sales activities data => Sales predictions using pandas, Scikit-learn, SQLAlchemy, Redshift, XGBoost Cl…☆27Updated 4 years ago
- Solved end-to-end machine learning projects☆32Updated last year
- pyspark dataframe made easy☆16Updated 3 years ago
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Updated last year
- This repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded …☆29Updated 3 years ago
- This is a guided certification project, as a part of Data Science for Social Good initiative☆17Updated 5 years ago
- Dataset accompanying the paper titled "Pothole detection and dimension estimation system using deep learning (YOLO) and image processing"☆10Updated 2 years ago
- Computer Vision Papers of the week☆17Updated 2 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 2 years ago
- ☆11Updated 4 years ago
- Insurance Claim Prediction using Machine Learning - Udacity Nanodegree Capstone Project☆16Updated 8 years ago
- Deploying Models to Production with Mlflow and AWS Sagemaker☆22Updated 3 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆15Updated 6 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Updated last year
- Data Analysis with Python - Customer Segmentation ( RFM Analysis) - Power BI Dashboard - Tableau Dashboard☆10Updated 4 years ago
- Portofolio repository for Udacity Data Scientist Nanodegree☆41Updated 4 years ago