columbia-applied-data-science / rosettaLinks
Tools, wrappers, etc... for data science with a concentration on text processing
☆207Updated 2 years ago
Alternatives and similar repositories for rosetta
Users that are interested in rosetta are comparing it to the libraries listed below
Sorting:
- A Topic Modeling toolbox☆92Updated 9 years ago
- lightweight python wrapper for vowpal wabbit☆169Updated 5 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 9 years ago
- Neural Net Starter Examples☆89Updated 10 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Python forecasting and smoothing library☆67Updated 6 years ago
- Tutorial and review of word2vec / doc2vec☆104Updated 10 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆71Updated 5 years ago
- different types of tutorials, such as machine learning, image processing and etc.☆102Updated 9 years ago
- ☆190Updated 2 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- SigOpt wrappers for scikit-learn methods☆75Updated 2 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 12 years ago
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- The repository contains code walkthroughs which introduces Deep Learning in the field of Natural Language Processing.☆109Updated 9 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- A new version of phraug, which is a set of simple Python scripts for pre-processing large files☆207Updated 7 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 3 months ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- ☆160Updated 8 years ago
- Snape is a convenient artificial dataset generator that wraps sklearn's make_classification and make_regression and then adds in 'realism…☆167Updated 5 years ago
- Experimental parallel data analysis toolkit.☆121Updated 3 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 9 years ago
- Code for Learning with Data Blog☆64Updated 8 years ago
- Model assisted random sampling.☆119Updated 5 years ago
- Uncertainty quantification book chapter☆49Updated 10 years ago
- Slides and notebooks for PyData Strata San Jose☆50Updated 10 years ago