tdhopper / rta-pyspark-presentation
Very basic introduction to pyspark
☆15Updated 7 years ago
Alternatives and similar repositories for rta-pyspark-presentation:
Users that are interested in rta-pyspark-presentation are comparing it to the libraries listed below
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- Introduction to structured prediction with Python and pystruct☆18Updated 6 years ago
- Slides and materials for most of my talks by year☆92Updated last year
- Notes for Data Science 350 Class☆24Updated 7 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- Springboard - Data Science Intensive course☆13Updated 7 years ago
- Course of Machine Learning in Science and Industry at Heidelberg university☆47Updated 7 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- This library is a wrapper for sklearn and works with data stored using Pandas module.☆17Updated 8 years ago
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Updated 8 years ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Updated 9 years ago
- Collection of presentation of my work on various platforms and meetups☆22Updated 5 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 6 years ago
- Companion code for my video course on Practical Python Data Science Techniques, published by Packt Publishing☆33Updated 7 years ago
- 32/2384 Solution to Kaggle Mercari Competition (solo silver medal winner)☆20Updated 6 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Workshop: Python for Data Science☆62Updated 10 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- Bayesian statistics seminars☆30Updated 7 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Updated 8 years ago
- In-class exercises for Deep Learning course at NYC Data Science Academy☆32Updated 6 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 10 years ago
- Crowd Course Data Science course project☆26Updated 8 years ago
- Repository for code used in Kaggle competitions.☆22Updated 6 years ago
- Material for Data analysis and machine learning in Jupyter☆21Updated 7 years ago
- ☆15Updated 6 years ago