Simhashing in C++
☆136Feb 14, 2023Updated 3 years ago
Alternatives and similar repositories for simhash-cpp
Users that are interested in simhash-cpp are comparing it to the libraries listed below
Sorting:
- Simhash and near-duplicate detection☆424May 15, 2023Updated 2 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Mar 16, 2017Updated 9 years ago
- A Python Implementation of Simhash Algorithm☆1,036Mar 24, 2022Updated 3 years ago
- An efficient simhash implementation for python☆128Oct 25, 2019Updated 6 years ago
- A cluster implementation of simhash near-duplicate detection☆32Mar 11, 2015Updated 11 years ago
- ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity☆44Oct 26, 2020Updated 5 years ago
- Compact Tree Representation☆16Mar 16, 2017Updated 9 years ago
- Rust implementation of probminhash, superminhash and hyperloglog sketching algorithms☆31Jan 22, 2026Updated last month
- A locality-sensitive hashing library☆44Feb 21, 2014Updated 12 years ago
- compressed, queryable variation graphs☆11Jun 25, 2015Updated 10 years ago
- Daichi Amagata and Takahiro Hara, SIGMOD2021☆15Apr 10, 2024Updated last year
- Relative data structures based on the BWT☆12Apr 28, 2018Updated 7 years ago
- Grail+ is a set of command line tools for manipulating non-deterministic finite automata (NFAs), non-deterministic pushdown automata (PDA…☆10Oct 22, 2016Updated 9 years ago
- ☆18Jul 9, 2018Updated 7 years ago
- A project for clustering text streams using locality-sensitive hashing (LSH) in Python☆26Sep 23, 2011Updated 14 years ago
- linear time suffix array construction algorithm☆12Oct 28, 2016Updated 9 years ago
- A HyperLogLog implementation in Rust.☆52Feb 9, 2026Updated last month
- C Application Framework☆101Oct 2, 2020Updated 5 years ago
- A Go library for space-efficient rank/select operations for both sparse and dense bit arrays.☆38Jul 24, 2020Updated 5 years ago
- Implementation of the data structures described in the paper "Fast Compressed Tries using Path Decomposition".☆58Jan 27, 2023Updated 3 years ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆121Nov 29, 2023Updated 2 years ago
- Passive Bitcoin Project☆10Aug 10, 2015Updated 10 years ago
- Bioinformatics 101 tool for counting unique k-length substrings in DNA☆33Feb 17, 2026Updated last month
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆293Jun 11, 2023Updated 2 years ago
- HyperLogLog implementations.☆28Aug 11, 2024Updated last year
- A simple kickstarter clone, for internal project bounties and backing.☆39May 8, 2015Updated 10 years ago
- FLECC_IN_C is a FLexible Elliptic Curve Cryptography library written IN C☆18Nov 17, 2017Updated 8 years ago
- Compute strain abundance in a defined microbial community☆10Jul 27, 2023Updated 2 years ago
- C library for efficient string matching with Aho-Corasick☆21Jan 20, 2012Updated 14 years ago
- Implementation of ip-nsw from Non-metric Similarity Graphs for Maximum Inner Product Search☆40Sep 17, 2018Updated 7 years ago
- TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation☆17Aug 3, 2025Updated 7 months ago
- Simple and fast MinHash implementation in C with Python wrapper☆13Jul 24, 2025Updated 7 months ago
- For bluntifying overlapped GFAs☆13Jul 26, 2024Updated last year
- Filter of Pairwise Alignement☆44Jan 31, 2022Updated 4 years ago
- Automatically exported from code.google.com/p/google-concurrency-library☆15Sep 17, 2015Updated 10 years ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆15Jun 28, 2022Updated 3 years ago
- Parallel Wavelet Tree and Wavelet Matrix Construction☆25Jun 27, 2023Updated 2 years ago
- BagMinHash - Minwise Hashing Algorithm for Weighted Sets☆26Aug 26, 2020Updated 5 years ago
- Browser-based annotation tool for Framenet☆16Jan 27, 2015Updated 11 years ago