VincentShenbw / similarityjoinLinks
Implementation of many similarity join algorithms.
☆15Updated 11 years ago
Alternatives and similar repositories for similarityjoin
Users that are interested in similarityjoin are comparing it to the libraries listed below
Sorting:
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆11Updated 2 years ago
- Implementation of QuadSketch algorithm☆11Updated 2 years ago
- SRS - Fast Approximate Nearest Neighbor Search in High Dimensional Euclidean Space With a Tiny Index☆55Updated 10 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated 2 years ago
- To Index or Not to Index: Optimizing Exact Maximum Inner Product Search☆26Updated 6 years ago
- Example project presented at the Succinct Data Structure Tutorial at SIGIR 2016☆25Updated 6 years ago
- Implementation of Bayesian Sets for fast similarity searches.☆14Updated 13 years ago
- A flexible variational inference LDA library.☆22Updated 6 years ago
- A C++ library to benchmark inverted indexes.☆20Updated 4 years ago
- Official repository of Quickscorer: a fast algorithm to rank documents with additive ensembles of regression trees.☆18Updated 8 years ago
- Optimal partitioning of Variable-Byte codes for better compression and fast decoding.☆17Updated 3 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆23Updated last year
- Parallel Suffix Array, LCP Array, and Suffix Tree Construction☆49Updated 5 years ago
- A Space-Optimal Grammar Compression☆10Updated 4 years ago
- A C++ template library for compact Hamming distance indexes☆10Updated 8 years ago
- Simple implementation of CoveringLSH☆10Updated 9 years ago
- ☆10Updated 9 years ago
- LSH index for approximate set containment search☆57Updated 2 years ago
- similarity join and search algorithms for edit distance and jaccard☆18Updated 7 years ago
- Efficient and effective query auto-completion in C++.☆54Updated last year
- Suite of universal indexes for Highly Repetitive Document Collections☆20Updated 5 years ago
- High-dimensional approximate nearest neighbor in python☆11Updated 6 years ago
- Python bindings to Succinct Data Structure Library 2.0☆31Updated 6 years ago
- Fast implementations of the scancount algorithm: C++ header-only library☆26Updated 5 years ago
- A C++ library for summarizing data streams☆24Updated 5 years ago
- TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation☆14Updated 2 years ago
- Clustering documents based on LSH☆14Updated 9 years ago
- google all pairs similarity search package, with swig bindings☆22Updated 10 years ago
- Yet another regression toolkit☆12Updated 11 years ago
- ☆26Updated 8 years ago