seomoz / simhash-cpp
Simhashing in C++
☆134Updated last year
Related projects ⓘ
Alternatives and complementary repositories for simhash-cpp
- A library of inverted index data structures☆145Updated last year
- A high performance search engine☆102Updated 7 years ago
- Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search☆166Updated 9 years ago
- An Implementation of Two-Trie and Tail-Trie using Double Array☆21Updated 11 years ago
- ☆80Updated 6 years ago
- Roaring Bitmap in Cython☆79Updated 5 months ago
- Simhash and near-duplicate detection☆409Updated last year
- Diskbased (persistent) hashtable☆156Updated last month
- LASSO is a parallel regression model learning system☆69Updated 10 years ago
- General purpose C++ library for iZENECloud☆42Updated 9 years ago
- HAT-Trie for Python☆86Updated 8 years ago
- Git mirror for the FastBit library Subversion repository.☆71Updated 7 years ago
- An efficient trie implementation.☆252Updated 3 years ago
- A locality-sensitive hashing library☆45Updated 10 years ago
- Parallel Suffix Array, LCP Array, and Suffix Tree Construction☆48Updated 5 years ago
- Trinity IR Infrastructure☆235Updated 5 years ago
- A-C implementation in "C". Tight-packed (interleaved) state-transition matrix -- as fast as it gets, as small as it gets.☆147Updated 3 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 7 years ago
- Real time vector search engine☆139Updated last year
- google all pairs similarity search package, with swig bindings☆23Updated 9 years ago
- Code used for the experiments in the paper "Partitioned Elias-Fano Indexes"☆37Updated 9 years ago
- Parallelizing word2vec in shared and distributed memory☆191Updated last year
- Weighted MinHash implementation on CUDA (multi-gpu).☆114Updated 11 months ago
- A framework for building reranking models.☆29Updated 9 years ago
- Big Data Made Easy☆184Updated 6 years ago
- C++ implementations of indexing mechanisms, including a Hilbert-curve geohash based spatial index and a linear hashing table, for disk or…☆73Updated 3 years ago
- C library for efficient string matching with Aho-Corasick☆21Updated 12 years ago
- Succinct Data Structure Library☆105Updated 11 years ago
- A clone of Darts (Double-ARray Trie System)☆142Updated 5 years ago
- Python extension module for accelerating regular expressions using libesm☆132Updated last year