columbia-applied-data-science / rosettaLinks
Tools, wrappers, etc... for data science with a concentration on text processing
☆206Updated 2 years ago
Alternatives and similar repositories for rosetta
Users that are interested in rosetta are comparing it to the libraries listed below
Sorting:
- A Topic Modeling toolbox☆92Updated 9 years ago
- lightweight python wrapper for vowpal wabbit☆168Updated 5 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 12 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- ☆190Updated 2 years ago
- Model assisted random sampling.☆119Updated 4 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 8 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Python Environment for Bayesian Learning☆104Updated 13 years ago
- A new version of phraug, which is a set of simple Python scripts for pre-processing large files☆206Updated 6 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 9 years ago
- Neural Net Starter Examples☆89Updated 10 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- Python forecasting and smoothing library☆67Updated 6 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆70Updated 5 years ago
- The repository contains code walkthroughs which introduces Deep Learning in the field of Natural Language Processing.☆108Updated 9 years ago
- Public Machine Learning and Data Competition Repo☆54Updated 9 years ago
- Solving NLP problems with Vowpal Wabbit: Tutorial and more☆183Updated 9 years ago
- Algorithm's team Jupyter Notebooks☆113Updated last month
- Tutorial and review of word2vec / doc2vec☆104Updated 10 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆37Updated 10 years ago
- ☆160Updated 8 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 6 years ago