nicodv / pyspark-tutorial
A short tutorial notebook on PySpark
☆15Updated 9 years ago
Alternatives and similar repositories for pyspark-tutorial:
Users that are interested in pyspark-tutorial are comparing it to the libraries listed below
- Slides and materials for most of my talks by year☆92Updated last year
- NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-lear…☆30Updated 6 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 6 years ago
- Tutorial on deploying machine learning models to production☆59Updated 5 years ago
- Pydata Dallas 2015 Scikit-Learn Tutorial☆62Updated 9 years ago
- All Kaggle competitions☆91Updated 8 years ago
- Containing codes of participation in Kaggle competitions.☆37Updated 9 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- A machine learning algorithm written to predict severity of insurance claim☆20Updated 8 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- Code for determining optimal number of clusters for K-means algorithm using the 'elbow criterion'☆41Updated last week
- This library is a wrapper for sklearn and works with data stored using Pandas module.☆17Updated 9 years ago
- ☆28Updated 6 years ago
- Slides, code and more for my class: Data Analytics and Machine Learning on Big Data☆8Updated 7 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- ☆31Updated 9 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- Detailed notes and codes on learning pandas quickly for machine learning.☆26Updated 8 years ago
- Analysis of NYC Green Taxi and a model to predict the tip as a percentage of the total fare☆45Updated 7 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- A compiled list of kaggle competitions and their winning solutions for sequence problems.☆35Updated 8 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- The Smart Recruit hackathon on AnalyticsVidhya☆17Updated 8 years ago
- ☆77Updated 8 years ago
- ☆66Updated last year
- Files for London PyData London, 2015☆15Updated 9 years ago
- Some small utility modules to help with pandas, numpy and sklearn usage in other projects☆22Updated 9 years ago
- A Tour of Time Series Analysis☆23Updated 8 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago