seomoz / simhash-cppLinks
Simhashing in C++
☆134Updated 2 years ago
Alternatives and similar repositories for simhash-cpp
Users that are interested in simhash-cpp are comparing it to the libraries listed below
Sorting:
- A library of inverted index data structures☆150Updated 2 years ago
- A high performance search engine☆107Updated 8 years ago
- google all pairs similarity search package, with swig bindings☆22Updated 10 years ago
- Roaring Bitmap in Cython☆81Updated last year
- Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search☆168Updated 10 years ago
- A framework for building reranking models.☆28Updated 10 years ago
- Trinity IR Infrastructure☆238Updated 5 years ago
- A-C implementation in "C". Tight-packed (interleaved) state-transition matrix -- as fast as it gets, as small as it gets.☆149Updated 4 years ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆117Updated last year
- Succinct Data Structure Library☆106Updated 11 years ago
- A locality-sensitive hashing library☆46Updated 11 years ago
- ☆81Updated 7 years ago
- General purpose C++ library for iZENECloud☆42Updated 10 years ago
- An efficient trie implementation.☆255Updated 4 years ago
- FM-Index full-text index implementation using RRR Wavelet trees (libcds) and fast suffix sorting (libdivsufsort) including experimental r…☆107Updated 10 years ago
- Efficient and effective query auto-completion in C++.☆55Updated last year
- Parallel Suffix Array, LCP Array, and Suffix Tree Construction☆49Updated 5 years ago
- similarity join and search algorithms for edit distance and jaccard☆18Updated 7 years ago
- HAT-Trie for Python☆86Updated 9 years ago
- Experimental search engine in C/C++17 - still in early development.☆27Updated 9 months ago
- A Hybrid Parallel Implementation of the Aho-Corasick and Wu-Manber Algorithms Using NVIDIA CUDA and MPI Evaluated on a Biological Sequenc…☆19Updated 8 years ago
- Parameterless and Universal FInding of Nearest Neighbors☆60Updated 4 months ago
- C library for efficient string matching with Aho-Corasick☆21Updated 13 years ago
- Code used for the experiments in the paper "Partitioned Elias-Fano Indexes"☆40Updated 10 years ago
- LASSO is a parallel regression model learning system☆69Updated 11 years ago
- Git mirror for the FastBit library Subversion repository.☆71Updated 8 years ago
- Big Data Made Easy☆185Updated 7 years ago
- SRS - Fast Approximate Nearest Neighbor Search in High Dimensional Euclidean Space With a Tiny Index☆55Updated 10 years ago
- Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering☆75Updated 3 years ago
- A clone of Darts (Double-ARray Trie System)☆149Updated 2 months ago