ernest-kiwele / chicago-crime-analysis-apache-spark
Using Apache Spark SQL, Spark ML, Pandas to analyse and predict using the Chicago crime dataset
☆11Updated 7 years ago
Alternatives and similar repositories for chicago-crime-analysis-apache-spark
Users that are interested in chicago-crime-analysis-apache-spark are comparing it to the libraries listed below
Sorting:
- Exploratory data analysis using Python, Numpy, Pandas, Seaborn for Carsale Advertisement Dataset☆8Updated 5 years ago
- Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Fraud…☆82Updated 5 years ago
- Portofolio repository for Udacity Data Scientist Nanodegree☆41Updated 4 years ago
- Book Projects☆24Updated 4 years ago
- ☆21Updated 6 years ago
- Introduction and Career Guide for Data Science enthusiasts☆9Updated 6 years ago
- data science interview questions company wise which include the data analyst , junior data scientist , machine learning engineer etc. pos…☆15Updated 3 years ago
- ☆63Updated 6 years ago
- Data analysis using numpy, pandas, matplotlib, seaborn, sqlite3, data wrangling☆31Updated 5 years ago
- Machine Learning Case study on customer segmentation and prediction of groups.☆31Updated 6 years ago
- {PySpark, R, Python}: Several Data Science projects☆15Updated 7 years ago
- Learning Machine Learning and showcasing my work for 100 Days.☆16Updated 6 years ago
- Udacity Data Science Nanodegree Repository. Contains lecture notes, and dummy scripts as well as projects undertaken for the nanodegree.☆30Updated 5 years ago
- Applying machine learning to predict loan charge-offs on LendingClub.com☆42Updated 6 years ago
- In this Data set we are Predicting the Insurance Claim by each user, Machine Learning algorithms for Regression analysis are used and Dat…☆39Updated 6 years ago
- Introduction In ecommerce companies like online retails, customer segmentation is necessary in order to understand customers behaviors. I…☆10Updated 5 years ago
- ☆17Updated 5 years ago
- Project - Data Processing and Analysis in Python Course☆41Updated 6 years ago
- This repository contains a collection of all the capstone projects made for the Data Analyst Nanodegree Certification - Udacity☆11Updated 6 years ago
- Credit Card Fraud Detection using ML: IEEE style paper + Jupyter Notebook☆104Updated 2 years ago
- My Graduate Capstone Project - This is a Product Recommendation System for a Local Wholesaler in India, using Python and Machine Learning…☆28Updated 4 years ago
- This repo contains the material and projects for Udacity Data science Nanodegree term 2☆12Updated 2 years ago
- My Experiments with Time Series☆25Updated 3 years ago
- Contains code and presentation for my interactive hack session, 'Effective Feature Engineering: A Structured Approach to Building Better …☆30Updated 4 years ago
- Predictive Analytics for Busines co-created by Alteryx and Tableau☆13Updated 8 years ago
- Detailed notes and code to learn the basics of machine learning with scikit-learn.☆35Updated 8 years ago
- It's a Git Repo containing source code, supported docker files, multiple linear regression pickle file and other related contents of Flas…☆27Updated 2 years ago
- Small example on how you can detect multicollinearity☆13Updated 3 years ago
- Analysing the content of an E-commerce database that contains list of purchases. Based on the analysis, I develop a model that allows to …☆134Updated 7 years ago
- ☆18Updated 7 years ago