seomoz / simhash-cppLinks
Simhashing in C++
☆135Updated 2 years ago
Alternatives and similar repositories for simhash-cpp
Users that are interested in simhash-cpp are comparing it to the libraries listed below
Sorting:
- A library of inverted index data structures☆150Updated 2 years ago
- A high performance search engine☆107Updated 8 years ago
- google all pairs similarity search package, with swig bindings☆22Updated 10 years ago
- Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search☆168Updated 10 years ago
- C library for efficient string matching with Aho-Corasick☆21Updated 13 years ago
- Real time vector search engine☆138Updated 2 years ago
- A framework for building reranking models.☆28Updated 10 years ago
- An efficient trie implementation.☆255Updated 4 years ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆119Updated last year
- A C++ library providing fast language model queries in compressed space.☆132Updated 2 years ago
- Trinity IR Infrastructure☆238Updated 5 years ago
- A-C implementation in "C". Tight-packed (interleaved) state-transition matrix -- as fast as it gets, as small as it gets.☆149Updated 4 years ago
- LASSO is a parallel regression model learning system☆69Updated 11 years ago
- A clone of Darts (Double-ARray Trie System)☆152Updated 3 months ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- similarity join and search algorithms for edit distance and jaccard☆18Updated 7 years ago
- Fast decoder for VByte-compressed integers☆124Updated last year
- Succinct Data Structure Library☆107Updated 11 years ago
- Code used for the experiments in the paper "Partitioned Elias-Fano Indexes"☆40Updated 10 years ago
- Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering☆77Updated 3 years ago
- Diskbased (persistent) hashtable☆164Updated 11 months ago
- General purpose C++ library for iZENECloud☆42Updated 10 years ago
- Parallel Suffix Array, LCP Array, and Suffix Tree Construction☆49Updated 6 years ago
- Python extension module for accelerating regular expressions using libesm☆132Updated last year
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆537Updated last year
- ☆81Updated 7 years ago
- Logistic regression engine for medium-sized data☆55Updated 10 years ago
- An Implementation of Two-Trie and Tail-Trie using Double Array☆21Updated 12 years ago
- Rolling Hash C++ Library☆187Updated last year
- Git mirror for the FastBit library Subversion repository.☆71Updated 8 years ago