nicodv / pyspark-tutorial
A short tutorial notebook on PySpark
☆15Updated 9 years ago
Alternatives and similar repositories for pyspark-tutorial:
Users that are interested in pyspark-tutorial are comparing it to the libraries listed below
- Slides and materials for most of my talks by year☆91Updated last year
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- Repository for the PyData DC 2016 tutorial☆29Updated 8 years ago
- Competition repository☆21Updated 5 years ago
- General Assembly repo for Data Science 18☆36Updated 9 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 6 years ago
- Metis Data Science Portfolio - Summer 2017☆26Updated 6 years ago
- This library is a wrapper for sklearn and works with data stored using Pandas module.☆17Updated 8 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 8 years ago
- Containing codes of participation in Kaggle competitions.☆37Updated 8 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 6 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Springboard - Data Science Intensive course☆13Updated 7 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 7 years ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Updated 9 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 6 years ago
- Detailed notes and codes on learning pandas quickly for machine learning.☆26Updated 8 years ago
- Winning solution for Analytics Vidhya Mini Hack☆21Updated 7 years ago
- 32/2384 Solution to Kaggle Mercari Competition (solo silver medal winner)☆20Updated 6 years ago
- Tutorial on deploying machine learning models to production☆58Updated 5 years ago
- RESTful API hosting xgboost model☆24Updated 7 years ago
- Jupyter Notebooks for Strata Data Conference NY 2017 Deep Learning for Recommender Systems Tutorial☆22Updated 7 years ago
- Notes for Data Science 350 Class☆23Updated 7 years ago
- Presentation on How to use Facebook Prophet☆8Updated 7 years ago
- Workshop: Python for Data Science☆61Updated 10 years ago
- Code for determining optimal number of clusters for K-means algorithm using the 'elbow criterion'☆41Updated last year
- ML Nanodegree Capstone Project - Predicting NYC Taxi Trip Duration☆12Updated 7 years ago
- The Smart Recruit hackathon on AnalyticsVidhya☆17Updated 8 years ago
- ☆26Updated last year