seomoz / simhash-cpp
Simhashing in C++
☆132Updated 2 years ago
Alternatives and similar repositories for simhash-cpp:
Users that are interested in simhash-cpp are comparing it to the libraries listed below
- A high performance search engine☆105Updated 8 years ago
- Roaring Bitmap in Cython☆81Updated 11 months ago
- A library of inverted index data structures☆148Updated 2 years ago
- ☆80Updated 7 years ago
- google all pairs similarity search package, with swig bindings☆22Updated 10 years ago
- Simhash and near-duplicate detection☆414Updated last year
- Diskbased (persistent) hashtable☆161Updated 6 months ago
- HAT-Trie for Python☆86Updated 9 years ago
- A-C implementation in "C". Tight-packed (interleaved) state-transition matrix -- as fast as it gets, as small as it gets.☆148Updated 4 years ago
- A locality-sensitive hashing library☆46Updated 11 years ago
- FM-Index full-text index implementation using RRR Wavelet trees (libcds) and fast suffix sorting (libdivsufsort) including experimental r…☆108Updated 10 years ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆117Updated last year
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- Parallel Suffix Array, LCP Array, and Suffix Tree Construction☆49Updated 5 years ago
- An Implementation of Two-Trie and Tail-Trie using Double Array☆21Updated 12 years ago
- Git mirror for the FastBit library Subversion repository.☆71Updated 8 years ago
- Succinct Data Structure Library☆106Updated 11 years ago
- Real time vector search engine☆137Updated 2 years ago
- Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering☆74Updated 3 years ago
- General purpose C++ library for iZENECloud☆42Updated 10 years ago
- Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search☆168Updated 9 years ago
- C++ implementation of hamming distance algorithm HmSearch using Kyoto Cabinet☆42Updated 8 years ago
- Experimental search engine in C/C++17 - still in early development.☆27Updated 6 months ago
- Trinity IR Infrastructure☆238Updated 5 years ago
- Fast differential coding functions (using SIMD instructions)☆52Updated 7 years ago
- A clone of Darts (Double-ARray Trie System)☆146Updated 6 years ago
- Parallelizing word2vec in shared and distributed memory☆190Updated 2 years ago
- Fast decoder for VByte-compressed integers☆122Updated 11 months ago
- Implementation of HNSW that supports online updates☆65Updated 7 years ago
- C++ Library implementing Compressed String Dictionaries☆46Updated 2 years ago