andrewclegg / sketchy
Simple approximate-nearest-neighbours in Python using locality sensitive hashing.
☆140Updated 12 years ago
Alternatives and similar repositories for sketchy
Users that are interested in sketchy are comparing it to the libraries listed below
Sorting:
- Python forecasting and smoothing library☆67Updated 6 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆100Updated 10 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- mltk - Moz Language Tool Kit☆12Updated 10 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- Fast Dot Products on Pretty Big Data☆15Updated 6 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- A set of distinct value estimators that give probabilistic bounds on a sets cardinality☆22Updated 5 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Data science tools from Moz☆22Updated 8 years ago
- A Bayesian testing framework written in Python.☆94Updated 10 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 9 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"☆64Updated 8 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Updated 11 years ago
- Fast Vector Operations on Pretty Big Data☆13Updated 9 years ago
- Topic Model or LDA in Cython☆21Updated 14 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 9 years ago
- This repository is not maintained anymore. ConfusionMatrix is now part of pandas-ml☆19Updated 8 years ago
- Python implementation of Markov Networks for neural computing.☆38Updated 2 months ago
- Common post-estimation tasks for scikit-learn☆17Updated 8 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- Fast, easy and intuitive machine learning prototyping.☆124Updated 10 years ago
- Preprocess text for NLP (tokenizing, lowercasing, stemming, sentence splitting, etc.)☆29Updated 13 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago