oertl / treeminhash
TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation
☆14Updated 2 years ago
Alternatives and similar repositories for treeminhash:
Users that are interested in treeminhash are comparing it to the libraries listed below
- BagMinHash - Minwise Hashing Algorithm for Weighted Sets☆26Updated 4 years ago
- ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity☆42Updated 4 years ago
- SetSketch: Filling the Gap between MinHash and HyperLogLog☆49Updated 3 years ago
- DartMinHash: Fast Sketching for Weighted Sets☆13Updated 3 years ago
- SRS - Fast Approximate Nearest Neighbor Search in High Dimensional Euclidean Space With a Tiny Index☆55Updated 9 years ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆11Updated 2 years ago
- Compact Tree Representation☆16Updated 7 years ago
- Optimal partitioning of Variable-Byte codes for better compression and fast decoding.☆17Updated 3 years ago
- A C++ library to benchmark inverted indexes.☆20Updated 4 years ago
- FOundation of stXXl and thriLL☆14Updated last year
- Parallel Wavelet Tree and Wavelet Matrix Construction☆26Updated last year
- similarity join and search algorithms for edit distance and jaccard☆18Updated 7 years ago
- A locality-sensitive hashing library☆46Updated 10 years ago
- Fast C++ implementation of https://github.com/yahoo/lopq: Locally Optimized Product Quantization (LOPQ) model and searcher for approximat…☆34Updated 4 years ago
- Succinct C++☆25Updated 4 years ago
- This program implements the following graph reordering technique: Laxman Dhulipala, Igor Kabiljo, Brian Karrer, Giuseppe Ottaviano, Serg…☆10Updated 6 years ago
- Online construction of run-length BWT (RLBWT) and r-index. Plus, online LZ77 parsing based on RLBWT.☆15Updated 6 years ago
- Code used for the experiments in the paper "Partitioned Elias-Fano Indexes"☆39Updated 10 years ago
- Parameterless and Universal FInding of Nearest Neighbors☆57Updated 8 months ago
- Paper about the estimation of cardinalities from HyperLogLog sketches☆61Updated 3 years ago
- Interleaving bits from two sources using SIMD instructions.☆14Updated 6 years ago
- 🔶 Compressed bitvector/container supporting efficient random access and rank queries☆43Updated 4 months ago
- Fast implementations of the scancount algorithm: C++ header-only library☆26Updated 5 years ago
- Wavelet tree based on a fixed block boosting technique☆16Updated 3 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆19Updated last year
- A SIMD-based C++ library providing rank/select queries over mutable bitmaps.☆35Updated 2 years ago
- C++ Implementation of Zip Trees☆14Updated 2 years ago
- High Dimensional Approximate Near(est) Neighbor☆33Updated 7 years ago
- Source code for the Shifted Hamming Distance (SHD) filtering mechanism for sequence alignment. Described in the Bioinformatics journal pa…☆32Updated 4 years ago