columbia-applied-data-science / rosetta
Tools, wrappers, etc... for data science with a concentration on text processing
☆206Updated 2 years ago
Alternatives and similar repositories for rosetta:
Users that are interested in rosetta are comparing it to the libraries listed below
- A Topic Modeling toolbox☆92Updated 8 years ago
- The repository contains code walkthroughs which introduces Deep Learning in the field of Natural Language Processing.☆109Updated 8 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 9 years ago
- lightweight python wrapper for vowpal wabbit☆166Updated 5 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Solving NLP problems with Vowpal Wabbit: Tutorial and more☆182Updated 8 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 9 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- Finding document vectors from pre-trained word2vec word vectors☆115Updated 9 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 11 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- ☆190Updated last year
- Neural Net Starter Examples☆89Updated 9 years ago
- PyData Seattle 2015: Python Data Bikeshed☆127Updated 9 years ago
- Tutorial and review of word2vec / doc2vec☆104Updated 9 years ago
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- Introduction to Deep Learning☆127Updated 9 years ago
- online natural language processing with word vectors☆310Updated 7 months ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆70Updated 5 years ago
- Transition-based statistical parser☆417Updated 7 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- Material for ODSCON San Francisco 2015☆79Updated 8 years ago
- Pydata NYC 2014 Scikit Learn Tutorial☆64Updated 10 years ago
- A new version of phraug, which is a set of simple Python scripts for pre-processing large files☆206Updated 6 years ago
- Repository containing files for my PyCon 2014 scikit-learn tutorial.☆226Updated 8 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆42Updated 8 years ago
- Code for Learning with Data Blog☆64Updated 7 years ago
- Kayak is a library for automatic differentiation with applications to deep neural networks.☆227Updated 7 years ago
- Rapid Machine Learning Prototyping in Python☆651Updated 9 years ago