tdhopper / rta-pyspark-presentation
Very basic introduction to pyspark
☆15Updated 8 years ago
Alternatives and similar repositories for rta-pyspark-presentation:
Users that are interested in rta-pyspark-presentation are comparing it to the libraries listed below
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Updated 8 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- Crowd Course Data Science course project☆27Updated 8 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- Slides and materials for most of my talks by year☆92Updated last year
- This library is a wrapper for sklearn and works with data stored using Pandas module.☆17Updated 9 years ago
- Introduction to structured prediction with Python and pystruct☆18Updated 6 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 10 years ago
- In-class exercises for Deep Learning course at NYC Data Science Academy☆32Updated 7 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- Companion code for my video course on Practical Python Data Science Techniques, published by Packt Publishing☆33Updated 7 years ago
- Bayesian statistics seminars☆30Updated 7 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 6 years ago
- ☆19Updated 4 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- Contains code for understanding TensorFlow workflow and basics☆51Updated 7 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- allennlp tutorial for O'Reilly AI Conference, September 2019☆22Updated 5 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Experimental library for sampling and validating scikit-learn parameters☆10Updated 5 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- Repository for my 'K-Means Clustering with Scikit-Learn' talk materials.☆43Updated 6 years ago
- Understanding Seattle Bike Count data☆17Updated 7 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Jupyter Notebook tips and tricks for the Berkeley Institute for Data Science lecture. http://bids.berkeley.edu/☆28Updated 9 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago