anindya-saha / Data-Science-with-Spark
Machine Learning and Data Analysis Case Studies using Spark.
☆72Updated 4 years ago
Alternatives and similar repositories for Data-Science-with-Spark:
Users that are interested in Data-Science-with-Spark are comparing it to the libraries listed below
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- a curated list of R tutorials for Data Science, NLP and Machine Learning☆23Updated 8 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- Program assignments for the Deep Learning Specialization at Coursera by Andrew Ng☆51Updated 7 years ago
- Project work for Udacity's AB Testing Course☆82Updated 7 years ago
- This repository contains Spark, MLlib, PySpark and Dataframes projects☆45Updated 7 years ago
- Codes written for some competitions☆13Updated 8 years ago
- Simple sentiment analysis model with PySpark☆43Updated 7 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- Learn Machine Learning using PySpark from scratch☆19Updated 6 years ago
- ☆63Updated 6 years ago
- Lending Club Loan data analysis☆164Updated 5 years ago
- Getting start with PySpark and MLlib☆297Updated 6 years ago
- L&T Financial Services & Analytics Vidhya presents ‘DataScience FinHack’ organised by Analytics Vidhya☆54Updated 5 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆150Updated 7 years ago
- This is the presentation on - What are the key points one should consider if they will be appearing in Data Science job interview☆40Updated 6 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆115Updated 8 months ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- Data analysis using numpy, pandas, matplotlib, seaborn, sqlite3, data wrangling☆31Updated 5 years ago
- Generic codes related to NLP☆85Updated 6 years ago
- Course on Udemy by Jose Portilla☆99Updated 7 years ago
- Data science blog☆33Updated 6 years ago
- Codes used for the hack session in DHS 2019☆53Updated 5 years ago
- ☆21Updated 6 years ago
- Repo will try to cover all the most frequently used ML algos with proper explanation and examples☆10Updated 6 years ago
- ☆77Updated 8 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Solutions to the book "Collection of Data Science TakeHome Challenges" in Python.☆10Updated 7 years ago
- Codes related to various ML Hackathons☆213Updated 5 years ago
- Notes from different sources such as Harvard CS109 course, Springboard's Data Science Interview questions, Elements of Programming Interv…☆35Updated 4 years ago