usc-isi-i2 / dig-lsh-clusteringLinks
Clustering documents based on LSH
☆14Updated 9 years ago
Alternatives and similar repositories for dig-lsh-clustering
Users that are interested in dig-lsh-clustering are comparing it to the libraries listed below
Sorting:
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- A pure python implementation of locality sensitive hashing for text documents☆85Updated 9 years ago
- Auto Encoder on Tensorflow☆12Updated 7 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- A startup search engine made using embeddings built on crunchbase company descriptions☆11Updated 9 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 10 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆98Updated 13 years ago
- ☆26Updated 8 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- A board game recommendation engine/model/website.☆40Updated 8 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆22Updated 7 years ago
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 6 years ago
- Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)☆36Updated 2 years ago
- Nonparametric timeseries classification for Twitter trending topic detection (MEng thesis)☆119Updated 11 years ago
- Semantic embeddings of entities☆66Updated 8 years ago
- Using Word2Vec on lists and sets☆34Updated 2 months ago
- framework for doing NER and other types of entity recognition, in Python☆68Updated 3 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆49Updated 12 years ago
- Reimplementation of deepwalk algorithm from https://github.com/phanein/deepwalk☆38Updated 9 years ago
- The useful and used parts of NN-Dropout☆25Updated 10 years ago
- An autoencoder to calculate word embeddings as mentioned in Lebret/Collobert paper 2015☆74Updated 8 years ago
- TREC Real-Time Summarization Tools☆15Updated 8 years ago
- locality sensitive hashing☆71Updated 13 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Updated 8 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- ☆96Updated 7 years ago
- Notes on Lambda Architecture☆12Updated 7 years ago
- In this project, we use skip-gram model to embed Wikipedia Concepts and Entities. The English version of Wikipedia contains more than fiv…☆57Updated 7 years ago
- A convolutional neural network library for NLP.☆59Updated 7 years ago