anindya-saha / Data-Science-with-Spark
Machine Learning and Data Analysis Case Studies using Spark.
☆72Updated 3 years ago
Alternatives and similar repositories for Data-Science-with-Spark:
Users that are interested in Data-Science-with-Spark are comparing it to the libraries listed below
- Project work for Udacity's AB Testing Course☆82Updated 7 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- Solutions to the book "Collection of Data Science TakeHome Challenges" in Python.☆10Updated 7 years ago
- Customer life time analysis (CLV analysis). We are using Gamma-Gamma model to estimate average transaction value for each customer.☆45Updated 6 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- ☆14Updated 7 years ago
- a curated list of R tutorials for Data Science, NLP and Machine Learning☆23Updated 8 years ago
- ☆77Updated 8 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆115Updated 7 months ago
- Program assignments for the Deep Learning Specialization at Coursera by Andrew Ng☆51Updated 7 years ago
- Codes written for some competitions☆13Updated 8 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆149Updated 7 years ago
- ☆57Updated 6 years ago
- Data science blog☆33Updated 6 years ago
- This repository contains Spark, MLlib, PySpark and Dataframes projects☆44Updated 7 years ago
- Lending Club Loan data analysis☆163Updated 5 years ago
- Data Science Take Home Challenges☆12Updated 6 years ago
- This is the presentation on - What are the key points one should consider if they will be appearing in Data Science job interview☆40Updated 6 years ago
- Getting start with PySpark and MLlib☆297Updated 6 years ago
- Simple sentiment analysis model with PySpark☆42Updated 7 years ago
- Codes, notes and guides on Udacity's machine learning nanodegree.☆83Updated 8 years ago
- ☆29Updated 6 years ago
- A curated list of awesome customer analytics content☆95Updated 7 years ago
- Build a flask app to server a machine learning model as a RESTful web service☆38Updated 7 years ago
- Unsupervised Clustering on Online Retail Dataset☆31Updated 5 years ago
- Projects submitted as part of working through udacity's data engineering nanodegree.☆9Updated 4 years ago
- My Solutions to "A Collection of Data Science Take-Home Challenges" by Giulio Palombo.☆78Updated 5 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 7 years ago