oertl / probminhash
ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity
☆42Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for probminhash
- DartMinHash: Fast Sketching for Weighted Sets☆13Updated 3 years ago
- SetSketch: Filling the Gap between MinHash and HyperLogLog☆49Updated 3 years ago
- COBS - Compact Bit-Sliced Signature Index (for Genomic k-Mer Data or q-Grams)☆83Updated 9 months ago
- A compressed, associative, exact, and weighted dictionary for k-mers.☆84Updated 3 weeks ago
- Implementation of a Backpack Quotient Filter☆10Updated 3 months ago
- BagMinHash - Minwise Hashing Algorithm for Weighted Sets☆26Updated 4 years ago
- Wavelet tree based on a fixed block boosting technique☆16Updated 3 years ago
- Simple and fast MinHash implementation in C with Python wrapper☆13Updated 4 years ago
- dynamic-updateable-index☆11Updated 9 years ago
- TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation☆14Updated last year
- An optimal space run-length Burrows-Wheeler transform full-text index☆58Updated last year
- Fast and compact locality-preserving minimal perfect hashing for k-mer sets.☆43Updated 11 months ago
- Relative data structures based on the BWT☆12Updated 6 years ago
- A tool for merging large BWTs☆26Updated 3 years ago
- ☆19Updated 4 years ago
- FM-Index full-text index implementation using RRR Wavelet trees (libcds) and fast suffix sorting (libdivsufsort) including experimental r…☆103Updated 9 years ago
- Fast and cache-efficient full-text pangenome index☆25Updated this week
- FM-index representation of a de Bruijn graph☆27Updated 7 years ago
- BWT Text Indexing Library: a set of tools to work with BWT-based text indexes☆25Updated 2 years ago
- An alignment-free, reference-free and incremental data structure for colored de Bruijn graph with application to pan-genome indexing.☆43Updated 2 years ago
- The BTL C/C++ Common bloom filters for bioinformatics projects, as well as any APIs created for other programming languages.☆18Updated 2 years ago
- R-Index-F Library for Pattern Matching☆13Updated 3 months ago
- C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings☆153Updated 3 months ago
- Online construction of run-length BWT (RLBWT) and r-index. Plus, online LZ77 parsing based on RLBWT.☆14Updated 6 years ago
- SIMD-parallel BLAST X-drop DP on sequence graphs☆24Updated 5 months ago
- Bit packed vector of integral values☆26Updated last year
- Ultra fast MSD radix sorter☆11Updated 4 years ago
- Smith-Waterman database searches with inter-sequence SIMD parallelisation☆58Updated last year
- Rust implementation of probminhash, superminhash and hyperloglog sketching algorithms☆23Updated last month
- Parallel Suffix Array, LCP Array, and Suffix Tree Construction☆48Updated 5 years ago