anindya-saha / Data-Science-with-SparkLinks
Machine Learning and Data Analysis Case Studies using Spark.
☆72Updated 4 years ago
Alternatives and similar repositories for Data-Science-with-Spark
Users that are interested in Data-Science-with-Spark are comparing it to the libraries listed below
Sorting:
- Project work for Udacity's AB Testing Course☆83Updated 8 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Getting start with PySpark and MLlib☆298Updated 7 years ago
- This is the presentation on - What are the key points one should consider if they will be appearing in Data Science job interview☆40Updated 6 years ago
- A compiled list of kaggle competitions and their winning solutions for regression problems.☆147Updated 8 years ago
- Lending Club Loan data analysis☆165Updated 6 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆57Updated 9 years ago
- Codes related to various ML Hackathons☆213Updated 6 years ago
- Codes written for some competitions☆13Updated 8 years ago
- Simple sentiment analysis model with PySpark☆43Updated 7 years ago
- A repository of machine learning codes written for re-usability☆143Updated 5 years ago
- Program assignments for the Deep Learning Specialization at Coursera by Andrew Ng☆51Updated 7 years ago
- Python script to pull machine learning flashcards from Chris Albon's twitter feed☆81Updated 6 years ago
- Code snippets and tutorials for working with social science data in PySpark☆421Updated 7 years ago
- Code repository for Learning PySpark by Packt☆332Updated 2 years ago
- A general-purpose framework for solving problems with machine learning applied to predicting customer churn☆413Updated last year
- L&T Financial Services & Analytics Vidhya presents ‘DataScience FinHack’ organised by Analytics Vidhya☆54Updated 6 years ago
- Notes on Apache Spark (pyspark)☆298Updated 6 years ago
- Updated repository☆157Updated 3 years ago
- Jupyter notebooks for pyspark tutorials given at University☆108Updated 7 months ago
- Apache Spark (PySpark) Practice on Real Data☆274Updated 5 years ago
- LearningApacheSpark☆244Updated last year
- ☆101Updated 7 years ago
- Notes from different sources such as Harvard CS109 course, Springboard's Data Science Interview questions, Elements of Programming Interv…☆35Updated 4 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆123Updated 2 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆153Updated 7 years ago
- Interview stuff for friends☆84Updated 2 years ago
- Notes on Flask REST API and tutorial☆148Updated 6 years ago
- A curated list of repositories for my book Machine Learning Solutions.☆78Updated 7 years ago
- Data sets and scripts for Coursera Big Data Specialization.☆167Updated last year