jmankoff / data
The repository for the CMU Data Pipeline course. This year's course should use branch 2017
☆40Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for data
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Updated 7 years ago
- Repository for data science course Spring 14☆181Updated 10 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- General Assembly repo for Data Science 18☆36Updated 9 years ago
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Updated 8 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆79Updated 6 years ago
- My talk at Strata 2014 in Santa Clara, CA☆73Updated 10 years ago
- Repository for my 'K-Means Clustering with Scikit-Learn' talk materials.☆43Updated 5 years ago
- Problem Sets for Jour72326: Scraping for Journalists.☆20Updated 7 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- Jupyter Notebook tips and tricks for the Berkeley Institute for Data Science lecture. http://bids.berkeley.edu/☆28Updated 8 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 10 years ago
- Repo for Working with Open Data (Spring 2014 edition), a course at the School of Information, UC Berkeley☆34Updated 8 years ago
- Data directory for the CS109 Data Science course☆66Updated 10 years ago
- ☆26Updated 10 months ago
- Some IPython notebooks I've created...☆29Updated 8 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- PySpark Notebook and Shiny App for Demo☆35Updated 7 years ago
- Pydata NYC 2014 Scikit Learn Tutorial☆64Updated 9 years ago
- PyData Madrid 2016 material for the talk: A Primer to recommendation Systems☆37Updated 8 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 7 years ago
- Scikit-learn quickstart tutorial for Webstep☆18Updated 7 years ago
- Code and Notebooks for the Natural Language Processing with Python course.☆66Updated 6 years ago
- A Shiny App for Telecom Customers churn prediction☆12Updated 10 years ago