columbia-applied-data-science / rosetta
Tools, wrappers, etc... for data science with a concentration on text processing
☆206Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for rosetta
- A Topic Modeling toolbox☆93Updated 8 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆102Updated 8 years ago
- lightweight python wrapper for vowpal wabbit☆166Updated 4 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆100Updated 9 years ago
- Python forecasting and smoothing library☆68Updated 5 years ago
- The repository contains code walkthroughs which introduces Deep Learning in the field of Natural Language Processing.☆109Updated 8 years ago
- Kayak is a library for automatic differentiation with applications to deep neural networks.☆227Updated 7 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- Finding document vectors from pre-trained word2vec word vectors☆115Updated 9 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 5 years ago
- Model assisted random sampling.☆121Updated 4 years ago
- ☆212Updated 8 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 9 years ago
- Neural Net Starter Examples☆89Updated 9 years ago
- Solving NLP problems with Vowpal Wabbit: Tutorial and more☆182Updated 8 years ago
- Material for ODSCON San Francisco 2015☆79Updated 8 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 8 years ago
- Python Environment for Bayesian Learning☆104Updated 13 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆109Updated 11 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 8 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆70Updated 5 years ago
- PyData Seattle 2015: Python Data Bikeshed☆127Updated 9 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 9 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago