tdhopper / rta-pyspark-presentationLinks
Very basic introduction to pyspark
☆15Updated 8 years ago
Alternatives and similar repositories for rta-pyspark-presentation
Users that are interested in rta-pyspark-presentation are comparing it to the libraries listed below
Sorting:
- Slides and materials for most of my talks by year☆92Updated last year
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 9 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 11 years ago
- ☆15Updated 7 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Updated 8 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 11 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Updated 10 years ago
- Machine Learning Versioning made Simple☆38Updated 3 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 7 years ago
- Introduction to structured prediction with Python and pystruct☆18Updated 7 years ago
- Miscellaneous Jupyter Notebooks for my course.☆24Updated 8 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Course of Machine Learning in Science and Industry at Heidelberg university☆47Updated 8 years ago
- Crowd Course Data Science course project☆27Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 10 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 11 years ago
- ☆57Updated 8 years ago
- Programs with word vectors, RNN, NLP stuff, etc☆18Updated 8 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 8 years ago
- ☆25Updated 9 years ago
- A helper library for data science pipeline☆36Updated 6 years ago
- Source code for the "Practical Data Science in Python" tutorial☆58Updated 10 years ago
- Bayesian statistics seminars☆30Updated 8 years ago
- ☆66Updated 2 years ago