seomoz / simhash-cpp
Simhashing in C++
☆133Updated 2 years ago
Alternatives and similar repositories for simhash-cpp:
Users that are interested in simhash-cpp are comparing it to the libraries listed below
- Python API for Various DB-Backed Simhash Clusters☆64Updated 7 years ago
- A high performance search engine☆104Updated 8 years ago
- A library of inverted index data structures☆148Updated 2 years ago
- Roaring Bitmap in Cython☆80Updated 8 months ago
- A-C implementation in "C". Tight-packed (interleaved) state-transition matrix -- as fast as it gets, as small as it gets.☆146Updated 4 years ago
- HAT-Trie for Python☆86Updated 9 years ago
- C library for efficient string matching with Aho-Corasick☆21Updated 13 years ago
- An Implementation of Two-Trie and Tail-Trie using Double Array☆21Updated 12 years ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆117Updated last year
- General purpose C++ library for iZENECloud☆42Updated 9 years ago
- Python extension module for accelerating regular expressions using libesm☆132Updated last year
- Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search☆167Updated 9 years ago
- A clone of Darts (Double-ARray Trie System)☆144Updated 6 years ago
- Simhash and near-duplicate detection☆412Updated last year
- Trinity IR Infrastructure☆237Updated 5 years ago
- A locality-sensitive hashing library☆46Updated 10 years ago
- Git mirror for the FastBit library Subversion repository.☆71Updated 8 years ago
- A General-Purpose Counting Filter: Counting Quotient Filter☆127Updated last year
- Rolling Hash C++ Library☆188Updated 11 months ago
- An efficient trie implementation.☆253Updated 4 years ago
- Experimental search engine in C/C++17 - still in early development.☆27Updated 4 months ago
- Fast decoder for VByte-compressed integers☆121Updated 8 months ago
- Fast differential coding functions (using SIMD instructions)☆52Updated 7 years ago
- BagMinHash - Minwise Hashing Algorithm for Weighted Sets☆26Updated 4 years ago
- google all pairs similarity search package, with swig bindings☆22Updated 9 years ago
- LASSO is a parallel regression model learning system☆69Updated 11 years ago
- Real time vector search engine☆138Updated last year
- Parallel Suffix Array, LCP Array, and Suffix Tree Construction☆48Updated 5 years ago
- A fast implementation for varbyte 32bit/64bit integer compression☆116Updated 7 years ago
- C++ implement of Tomas Mikolov's word/document embedding☆104Updated 7 years ago