phoenix24 / google-all-pairs-similarity-searchLinks
google all pairs similarity search package, with swig bindings
☆23Updated 10 years ago
Alternatives and similar repositories for google-all-pairs-similarity-search
Users that are interested in google-all-pairs-similarity-search are comparing it to the libraries listed below
Sorting:
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Updated 9 years ago
- Hacky implementation of ppjoin by Chuan Xia et Al☆19Updated 11 years ago
- ☆27Updated 10 years ago
- LASER-A Scalable Response Prediction Platform For Online Advertising☆48Updated 11 years ago
- Simhashing in C++☆136Updated 2 years ago
- This toolkit consists of implementations of various graph-based semi-supervised learning (SSL) algorithms. Currently, three algorithms ar…☆154Updated 8 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆98Updated 14 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Updated 10 years ago
- Automatically exported from code.google.com/p/jforests☆67Updated 5 years ago
- ☆37Updated 7 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 7 years ago
- Semantic embeddings of entities☆66Updated 9 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 13 years ago
- Locality Sensitive Hashing for Apache Spark☆87Updated 3 years ago
- ☆70Updated 10 years ago
- ☆52Updated 8 years ago
- PLDA: Parallel Latent Dirichlet Allocation in C++☆83Updated 2 years ago
- This tool extracts word vectors from Lucene index.☆135Updated 8 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- C++ implement of Tomas Mikolov's word/document embedding☆106Updated 8 years ago
- Deep Learning for NLP resources☆17Updated 10 years ago
- ☆26Updated 8 years ago
- LASSO is a parallel regression model learning system☆69Updated 12 years ago
- Nonparametric timeseries classification for Twitter trending topic detection (MEng thesis)☆119Updated 12 years ago
- Repository for post higgs-competition model submission☆26Updated 11 years ago
- Locality-sensitive hashing in PySpark.☆27Updated 10 years ago
- From Natural Language Text to Graph Database☆31Updated 9 years ago
- A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.☆148Updated last year
- Python Approximate Nearest Neighbor Search in very high dimensional spaces with optimised indexing.☆216Updated 4 years ago