andrewclegg / sketchy
Simple approximate-nearest-neighbours in Python using locality sensitive hashing.
☆140Updated 12 years ago
Related projects ⓘ
Alternatives and complementary repositories for sketchy
- Python forecasting and smoothing library☆68Updated 5 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 2 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- Demo code for learning_text_transformer☆25Updated 9 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- A Topic Modeling toolbox☆93Updated 8 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 5 years ago
- Dirichlet process mixture model (DPMM) for datamicroscopes☆12Updated 9 years ago
- mltk - Moz Language Tool Kit☆12Updated 9 years ago
- ☆33Updated 8 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"☆65Updated 7 years ago
- A Bayesian testing framework written in Python.☆95Updated 9 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Updated 13 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Machine learning evaluation database☆24Updated 6 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆46Updated 5 years ago
- Flask app to run a bandit algorithm testing different beer recommenders☆25Updated 10 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 8 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆102Updated 9 years ago
- Fast Vector Operations on Pretty Big Data☆13Updated 8 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago