go2starr / lshhdcLinks
LSH based high dimensional clustering for sets and points
☆80Updated 11 years ago
Alternatives and similar repositories for lshhdc
Users that are interested in lshhdc are comparing it to the libraries listed below
Sorting:
- A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.☆148Updated last year
- Various gfx for a presentation at NYC ML meetup☆62Updated 10 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 8 years ago
- A pure python implementation of locality sensitive hashing for text documents☆85Updated 10 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆98Updated 10 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 11 years ago
- Dynamic Topic Model (based upon code released by David Blei at http://www.cs.princeton.edu/~blei/topicmodeling.html)☆31Updated 7 years ago
- It is a forest of random projection trees☆224Updated 5 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 9 years ago
- Data Clustering in Python☆44Updated 8 years ago
- Online Max-Margin Topic Models for Accurate and Fast Text Classification [release v0.1]☆53Updated 9 years ago
- Collaborative modeling for recommendation. Implements variational inference for a collaborative topic models. These models recommend item…☆147Updated 10 years ago
- Entity level sentiment analysis for product reviews using deep learning☆56Updated 9 years ago
- Classifying text with bag-of-words☆113Updated 10 years ago
- Topic modelling software using non-parametric methods☆70Updated 9 years ago
- lightweight python wrapper for vowpal wabbit☆168Updated 5 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- A new version of phraug, which is a set of simple Python scripts for pre-processing large files☆207Updated 7 years ago
- Additive Groves, Bagged Trees with Feature Evaluation, Interaction Detection, Visualization of Feature Effects.☆66Updated 4 years ago
- Example Python code for comparing documents using MinHash☆251Updated 6 years ago
- Finding document vectors from pre-trained word2vec word vectors☆116Updated 10 years ago
- A high performance implementation of HDBSCAN clustering. http://hdbscan.readthedocs.io/en/latest/☆100Updated 8 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Updated 3 years ago
- Word vectors☆64Updated 7 years ago
- bindings for the sofia-ml machine learning library☆37Updated 2 years ago
- Remove Tomek Links from your data.☆30Updated 8 years ago
- Knowledge extraction from web data☆92Updated 7 years ago
- An implementation of Caruana et al's Ensemble Selection algorithm in Python, based on scikit-learn☆150Updated 4 years ago
- A fast Python implementation of locality sensitive hashing.☆70Updated 10 years ago
- Locality-sensitive hashing in PySpark.☆27Updated 10 years ago