simonemainardi / LSHash
A fast Python implementation of locality sensitive hashing.
☆70Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for LSHash
- A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.☆144Updated 2 months ago
- It is a forest of random projection trees☆224Updated 4 years ago
- A pure python implementation of locality sensitive hashing for text documents☆87Updated 9 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆100Updated 9 years ago
- LSH based high dimensional clustering for sets and points☆79Updated 10 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- Python Approximate Nearest Neighbor Search in very high dimensional spaces with optimised indexing.☆215Updated 3 years ago
- A Topic Modeling toolbox☆93Updated 8 years ago
- ☆26Updated 7 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆109Updated 11 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆39Updated 9 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 8 years ago
- lightweight python wrapper for vowpal wabbit☆166Updated 4 years ago
- Distributed Numpy☆148Updated 6 years ago
- Unified interface for local and distributed ndarrays☆158Updated 6 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆102Updated 9 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆102Updated 8 years ago
- Estimating how similar are two sets using MinHash (Jaccard similarity coefficient)☆30Updated 11 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆59Updated 3 years ago
- Machine learning evaluation database☆24Updated 6 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- ☆44Updated 9 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Fast HyperLogLog for Python.☆99Updated 2 months ago
- Supervised learning for novelty detection in text☆79Updated 8 years ago
- Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"☆65Updated 7 years ago
- Python forecasting and smoothing library☆68Updated 5 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago