GalvanizeDataScience / pipelines_and_featureunions
An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.
☆16Updated 7 years ago
Alternatives and similar repositories for pipelines_and_featureunions:
Users that are interested in pipelines_and_featureunions are comparing it to the libraries listed below
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems☆20Updated 6 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Run Nx2 Cross Validation for multiple binary classifiers in parallel with optional downsampling☆13Updated 10 years ago
- ☆26Updated 9 years ago
- Code for PyData Talk on "Classifying Products Based on Images and Text using Keras"☆30Updated 7 years ago
- the 2nd place solution for West Nile Virus Prediction challenge on Kaggle☆36Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- ☆13Updated 7 years ago
- ☆24Updated 8 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- Fast, accurate, lightweight, multi-core ML in Python, leveraging Vowpal Wabbit☆21Updated 6 years ago
- A tool that evolves small brains capable of scanning and classifying an image.☆13Updated 8 years ago
- NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-lear…☆30Updated 6 years ago
- Slides for my doc2vec workshop/talk☆29Updated 7 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- My best submission to the Kaggle competition "Online Product Sales", ranked 21th over 366 teams.☆29Updated 12 years ago
- Healthcare Twitter Analysis☆26Updated 8 years ago
- Reinforcement Learning Algorithms☆14Updated 6 years ago
- Docker container with a PyData stack and JupyterHub server☆37Updated 8 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 9 years ago
- Predicting closed questions on Stack Overflow☆45Updated 7 years ago
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- Pydata Seattle 2015 Trend Estimation in Time Series Signals Deck + Notebooks☆21Updated 9 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- My Tutorial for PyData London☆25Updated 9 years ago