andrewclegg / sketchy
Simple approximate-nearest-neighbours in Python using locality sensitive hashing.
☆140Updated 12 years ago
Alternatives and similar repositories for sketchy:
Users that are interested in sketchy are comparing it to the libraries listed below
- Python forecasting and smoothing library☆67Updated 5 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"☆65Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- lightweight python wrapper for vowpal wabbit☆166Updated 5 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- A set of distinct value estimators that give probabilistic bounds on a sets cardinality☆22Updated 5 years ago
- locality sensitive hashing☆69Updated 12 years ago
- Concept discovery and recommendation library built on top of the IBM Watson cognitive API.☆24Updated 8 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- Data science tools from Moz☆22Updated 8 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆33Updated 9 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated 10 months ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- feng - feature engineering for machine-learning champions☆27Updated 7 years ago
- SmallK: very fast data clustering tools☆14Updated 5 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 8 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- A fast Python implementation of locality sensitive hashing.☆70Updated 9 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆40Updated 9 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Collection of dask example notebooks☆57Updated 6 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 9 years ago