nicodv / pyspark-tutorial
A short tutorial notebook on PySpark
☆15Updated 9 years ago
Alternatives and similar repositories for pyspark-tutorial:
Users that are interested in pyspark-tutorial are comparing it to the libraries listed below
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- All Kaggle competitions☆91Updated 8 years ago
- Slides and materials for most of my talks by year☆92Updated last year
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 6 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Pydata Dallas 2015 Scikit-Learn Tutorial☆62Updated 9 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 10 years ago
- General Assembly repo for Data Science 18☆36Updated 9 years ago
- Material for Machine Learning Meetup "Machine Learning with Scikit-learn"☆29Updated 9 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆115Updated 8 months ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Updated 10 years ago
- Kaggle competition results☆20Updated 6 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Containing codes of participation in Kaggle competitions.☆37Updated 9 years ago
- A collection of statistical tools to aid Data Science competitors in Kaggle Competitions.☆63Updated 7 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Updated 8 years ago
- Tutorial on deploying machine learning models to production☆59Updated 5 years ago
- Code for determining optimal number of clusters for K-means algorithm using the 'elbow criterion'☆41Updated last month
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- A Tour of Time Series Analysis☆23Updated 8 years ago
- 32/2384 Solution to Kaggle Mercari Competition (solo silver medal winner)☆21Updated 7 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 7 years ago
- Tutorial Created for SciPy 2012☆58Updated 11 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- Repository for the PyData DC 2016 tutorial☆29Updated 8 years ago
- Scikit-learn quickstart tutorial for Webstep☆19Updated 7 years ago
- ML Nanodegree Capstone Project - Predicting NYC Taxi Trip Duration☆12Updated 7 years ago
- Detailed notes and codes on learning pandas quickly for machine learning.☆26Updated 8 years ago