seomoz / simhash-cppLinks
Simhashing in C++
☆133Updated 2 years ago
Alternatives and similar repositories for simhash-cpp
Users that are interested in simhash-cpp are comparing it to the libraries listed below
Sorting:
- A library of inverted index data structures☆150Updated 2 years ago
- A high performance search engine☆106Updated 8 years ago
- A C++ library providing fast language model queries in compressed space.☆130Updated 2 years ago
- Git mirror for the FastBit library Subversion repository.☆71Updated 8 years ago
- A-C implementation in "C". Tight-packed (interleaved) state-transition matrix -- as fast as it gets, as small as it gets.☆149Updated 4 years ago
- A locality-sensitive hashing library☆46Updated 11 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- Fast differential coding functions (using SIMD instructions)☆54Updated 7 years ago
- google all pairs similarity search package, with swig bindings☆22Updated 10 years ago
- General purpose C++ library for iZENECloud☆42Updated 10 years ago
- Simhash and near-duplicate detection☆416Updated 2 years ago
- Code used for the experiments in the paper "Partitioned Elias-Fano Indexes"☆40Updated 10 years ago
- C++ implementations of indexing mechanisms, including a Hilbert-curve geohash based spatial index and a linear hashing table, for disk or…☆77Updated 4 years ago
- A General-Purpose Counting Filter: Counting Quotient Filter☆127Updated last year
- Rolling Hash C++ Library☆187Updated last year
- Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search☆168Updated 10 years ago
- An efficient trie implementation.☆255Updated 4 years ago
- An Implementation of Two-Trie and Tail-Trie using Double Array☆21Updated 12 years ago
- Example project presented at the Succinct Data Structure Tutorial at SIGIR 2016☆25Updated 6 years ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆117Updated last year
- Language Detection based on Chromium's Compact Language Detector library☆105Updated 4 years ago
- A clone of Darts (Double-ARray Trie System)☆149Updated last month
- Fast decoder for VByte-compressed integers☆123Updated last year
- A framework for building reranking models.☆28Updated 10 years ago
- Count-Min sketch-based approximate counting library☆45Updated last month
- An efficient external-memory algorithm for the construction of minimal perfect hash functions☆64Updated last year
- Succinct Data Structure Library☆106Updated 11 years ago
- LASSO is a parallel regression model learning system☆69Updated 11 years ago
- Diskbased (persistent) hashtable☆161Updated 8 months ago
- ☆81Updated 7 years ago