columbia-applied-data-science / rosettaLinks
Tools, wrappers, etc... for data science with a concentration on text processing
☆206Updated 2 years ago
Alternatives and similar repositories for rosetta
Users that are interested in rosetta are comparing it to the libraries listed below
Sorting:
- A Topic Modeling toolbox☆92Updated 9 years ago
- lightweight python wrapper for vowpal wabbit☆169Updated 5 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆71Updated 5 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 12 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- Neural Net Starter Examples☆89Updated 10 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Tutorial and review of word2vec / doc2vec☆104Updated 10 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Public Machine Learning and Data Competition Repo☆54Updated 9 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 9 years ago
- The repository contains code walkthroughs which introduces Deep Learning in the field of Natural Language Processing.☆108Updated 9 years ago
- ☆190Updated 2 years ago
- A new version of phraug, which is a set of simple Python scripts for pre-processing large files☆207Updated 7 years ago
- Python forecasting and smoothing library☆67Updated 6 years ago
- Model assisted random sampling.☆119Updated 5 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- Experimental parallel data analysis toolkit.☆121Updated 3 years ago
- Snape is a convenient artificial dataset generator that wraps sklearn's make_classification and make_regression and then adds in 'realism…☆167Updated 5 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- Solving NLP problems with Vowpal Wabbit: Tutorial and more☆183Updated 9 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Code for Learning with Data Blog☆64Updated 8 years ago
- SigOpt wrappers for scikit-learn methods☆75Updated 2 years ago