VincentShenbw / similarityjoin
Implementation of many similarity join algorithms.
☆15Updated 11 years ago
Alternatives and similar repositories for similarityjoin
Users that are interested in similarityjoin are comparing it to the libraries listed below
Sorting:
- Implementation of Bayesian Sets for fast similarity searches.☆14Updated 13 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated 2 years ago
- Implements dictionary-based entity extraction as described in the FAERIE paper http://dbgroup.cs.tsinghua.edu.cn/dd/papers/sigmod2011-fae…☆9Updated 8 years ago
- Official repository of Quickscorer: a fast algorithm to rank documents with additive ensembles of regression trees.☆18Updated 8 years ago
- LSH index for approximate set containment search☆57Updated 2 years ago
- Implementation of QuadSketch algorithm☆11Updated 2 years ago
- A C++ library to benchmark inverted indexes.☆20Updated 4 years ago
- Gopalan, P., Ruiz, F. J., Ranganath, R., & Blei, D. M. (2014). Bayesian Nonparametric Poisson Factorization for Recommendation Systems. I…☆15Updated 10 years ago
- A Space-Optimal Grammar Compression☆10Updated 4 years ago
- SRS - Fast Approximate Nearest Neighbor Search in High Dimensional Euclidean Space With a Tiny Index☆55Updated 9 years ago
- Semantic embeddings of entities☆66Updated 8 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆11Updated 2 years ago
- TuffyLite is an open-source MLN inference engine that modifies the original Tuffy solver.☆27Updated 8 years ago
- Example project presented at the Succinct Data Structure Tutorial at SIGIR 2016☆25Updated 6 years ago
- Parameterless and Universal FInding of Nearest Neighbors☆60Updated 2 months ago
- google all pairs similarity search package, with swig bindings☆22Updated 10 years ago
- A C++ library providing fast language model queries in compressed space.☆129Updated 2 years ago
- High-dimensional approximate nearest neighbor in python☆11Updated 6 years ago
- To Index or Not to Index: Optimizing Exact Maximum Inner Product Search☆26Updated 6 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- Code repo for the SIGIR '17 paper "Efficient Cost-Aware Cascade Ranking for Multi-Stage Retrieval"☆10Updated 2 years ago
- Modify word2vec such that it's possible to "condition" on existing embeddings for some words, and induce embeddings for new words.☆39Updated 9 years ago
- Implementation of fast exact k-means algorithms☆45Updated 5 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆23Updated last year
- ☆16Updated 8 years ago
- TREC Real-Time Summarization Tools☆15Updated 7 years ago
- A flexible variational inference LDA library.☆22Updated 6 years ago
- Jubatus algorithm component☆20Updated 6 years ago
- Scalable inference for Correlated Topic Models☆30Updated 10 years ago