anindya-saha / Data-Science-with-SparkLinks
Machine Learning and Data Analysis Case Studies using Spark.
☆72Updated 4 years ago
Alternatives and similar repositories for Data-Science-with-Spark
Users that are interested in Data-Science-with-Spark are comparing it to the libraries listed below
Sorting:
- Getting start with PySpark and MLlib☆300Updated 7 years ago
- Project work for Udacity's AB Testing Course☆83Updated 8 years ago
- PySpark Code for Hands-on Learners☆117Updated 6 years ago
- Program assignments for the Deep Learning Specialization at Coursera by Andrew Ng☆51Updated 8 years ago
- Codes related to various ML Hackathons☆211Updated 6 years ago
- Notes on Apache Spark (pyspark)☆297Updated 6 years ago
- Code snippets and tutorials for working with social science data in PySpark☆421Updated 8 years ago
- Lending Club Loan data analysis☆167Updated 6 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆58Updated 9 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 9 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆156Updated 8 years ago
- LearningApacheSpark☆250Updated 2 years ago
- A general-purpose framework for solving problems with machine learning applied to predicting customer churn☆423Updated last year
- Source Code for 'Machine Learning with PySpark' by Pramod Singh☆117Updated 6 years ago
- L&T Financial Services & Analytics Vidhya presents ‘DataScience FinHack’ organised by Analytics Vidhya☆55Updated 6 years ago
- Solutions to the book "Collection of Data Science TakeHome Challenges" in Python.☆10Updated 8 years ago
- Learn Machine Learning using PySpark from scratch☆20Updated 7 years ago
- A compiled list of kaggle competitions and their winning solutions for regression problems.☆149Updated 9 years ago
- Simple sentiment analysis model with PySpark☆42Updated 7 years ago
- Apache Spark (PySpark) Practice on Real Data☆273Updated 6 years ago
- This is the presentation on - What are the key points one should consider if they will be appearing in Data Science job interview☆40Updated 7 years ago
- Code repository for Learning PySpark by Packt☆340Updated 3 years ago
- This repository contains Spark, MLlib, PySpark and Dataframes projects☆49Updated 8 years ago
- A curated list of repositories for my book Machine Learning Solutions.☆81Updated 7 years ago
- Generic codes related to NLP☆85Updated 7 years ago
- Tips for Advanced Feature Engineering☆53Updated 5 years ago
- Data sets and scripts for Coursera Big Data Specialization.☆172Updated last year
- Codes, notes and guides on Udacity's machine learning nanodegree.☆82Updated 9 years ago
- Updated repository☆157Updated 4 years ago
- a curated list of R tutorials for Data Science, NLP and Machine Learning☆23Updated 9 years ago