oertl / treeminhash
TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation
☆14Updated 2 years ago
Alternatives and similar repositories for treeminhash:
Users that are interested in treeminhash are comparing it to the libraries listed below
- BagMinHash - Minwise Hashing Algorithm for Weighted Sets☆26Updated 4 years ago
- ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity☆42Updated 4 years ago
- SetSketch: Filling the Gap between MinHash and HyperLogLog☆49Updated 3 years ago
- DartMinHash: Fast Sketching for Weighted Sets☆13Updated 4 years ago
- Compact Tree Representation☆16Updated 8 years ago
- SRS - Fast Approximate Nearest Neighbor Search in High Dimensional Euclidean Space With a Tiny Index☆55Updated 9 years ago
- 🔶 Compressed bitvector/container supporting efficient random access and rank queries☆43Updated 7 months ago
- This program implements the following graph reordering technique: Laxman Dhulipala, Igor Kabiljo, Brian Karrer, Giuseppe Ottaviano, Serg…☆10Updated 6 years ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆11Updated 2 years ago
- Optimal partitioning of Variable-Byte codes for better compression and fast decoding.☆17Updated 3 years ago
- A C++ library for summarizing data streams☆24Updated 5 years ago
- Parameterless and Universal FInding of Nearest Neighbors☆59Updated 3 weeks ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆22Updated last year
- Parallel Suffix Array, LCP Array, and Suffix Tree Construction☆49Updated 5 years ago
- Fast C++ implementation of https://github.com/yahoo/lopq: Locally Optimized Product Quantization (LOPQ) model and searcher for approximat…☆34Updated 4 years ago
- A locality-sensitive hashing library☆46Updated 11 years ago
- FOundation of stXXl and thriLL☆14Updated last year
- Interleaving bits from two sources using SIMD instructions.☆14Updated 7 years ago
- Online construction of run-length BWT (RLBWT) and r-index. Plus, online LZ77 parsing based on RLBWT.☆15Updated 6 years ago
- Wavelet tree based on a fixed block boosting technique☆16Updated 3 years ago
- A C++ library to benchmark inverted indexes.☆20Updated 4 years ago
- C library to compute the Hamming weight of arrays☆45Updated 6 years ago
- similarity join and search algorithms for edit distance and jaccard☆18Updated 7 years ago
- Daichi Amagata and Takahiro Hara, SIGMOD2021☆14Updated last year
- C++ Library implementing Compressed String Dictionaries☆46Updated 2 years ago
- C++ implementations of indexing mechanisms, including a Hilbert-curve geohash based spatial index and a linear hashing table, for disk or…☆76Updated 4 years ago
- A framework for index based similarity search.☆19Updated 5 years ago
- Suite of universal indexes for Highly Repetitive Document Collections☆20Updated 4 years ago
- C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings☆152Updated 8 months ago
- C++ Implementation of Zip Trees☆14Updated 2 years ago