Simhashing in C++
☆136Feb 14, 2023Updated 3 years ago
Alternatives and similar repositories for simhash-cpp
Users that are interested in simhash-cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simhash and near-duplicate detection☆423May 15, 2023Updated 2 years ago
- A Python Implementation of Simhash Algorithm☆1,036Mar 24, 2022Updated 4 years ago
- An efficient simhash implementation for python☆128Oct 25, 2019Updated 6 years ago
- A cluster implementation of simhash near-duplicate detection☆32Mar 11, 2015Updated 11 years ago
- 中文文档simhash值计算☆1,168Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Compact Tree Representation☆16Mar 16, 2017Updated 9 years ago
- Rust implementation of probminhash, superminhash and hyperloglog sketching algorithms☆31Jan 22, 2026Updated 3 months ago
- A locality-sensitive hashing library☆44Feb 21, 2014Updated 12 years ago
- compressed, queryable variation graphs☆11Jun 25, 2015Updated 10 years ago
- Relative data structures based on the BWT☆12Apr 28, 2018Updated 8 years ago
- Grail+ is a set of command line tools for manipulating non-deterministic finite automata (NFAs), non-deterministic pushdown automata (PDA…☆10Oct 22, 2016Updated 9 years ago
- ☆18Jul 9, 2018Updated 7 years ago
- Daichi Amagata and Takahiro Hara, SIGMOD2021☆15Apr 10, 2024Updated 2 years ago
- linear time suffix array construction algorithm☆12Oct 28, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- C++ library to parse WARC files☆11Jan 27, 2019Updated 7 years ago
- C Application Framework☆101Oct 2, 2020Updated 5 years ago
- Implementation of the data structures described in the paper "Fast Compressed Tries using Path Decomposition".☆58Jan 27, 2023Updated 3 years ago
- Passive Bitcoin Project☆10Aug 10, 2015Updated 10 years ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆122Nov 29, 2023Updated 2 years ago
- Bioinformatics 101 tool for counting unique k-length substrings in DNA☆33Feb 17, 2026Updated 2 months ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆291Jun 11, 2023Updated 2 years ago
- A simple kickstarter clone, for internal project bounties and backing.☆39May 8, 2015Updated 10 years ago
- FLECC_IN_C is a FLexible Elliptic Curve Cryptography library written IN C☆18Nov 17, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Mar 10, 2025Updated last year
- Scrapy project with spiders to extract article content from various german news sites☆21Sep 13, 2013Updated 12 years ago
- ☆12Dec 18, 2018Updated 7 years ago
- Compute strain abundance in a defined microbial community☆10Jul 27, 2023Updated 2 years ago
- C library for efficient string matching with Aho-Corasick☆21Jan 20, 2012Updated 14 years ago
- Repository for the Performance Interface eXtractor (PIX) tool presented at NSDI'22.☆17Jul 14, 2022Updated 3 years ago
- Implementation of ip-nsw from Non-metric Similarity Graphs for Maximum Inner Product Search☆41Sep 17, 2018Updated 7 years ago
- TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation☆17Aug 3, 2025Updated 9 months ago
- Simple and fast MinHash implementation in C with Python wrapper☆13Jul 24, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- For bluntifying overlapped GFAs☆13Jul 26, 2024Updated last year
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆15Jun 28, 2022Updated 3 years ago
- Parallel Wavelet Tree and Wavelet Matrix Construction☆25Jun 27, 2023Updated 2 years ago
- BagMinHash - Minwise Hashing Algorithm for Weighted Sets☆26Aug 26, 2020Updated 5 years ago
- Extended HTTP Support☆18Aug 27, 2015Updated 10 years ago
- Recursive unified ORAM☆16Sep 23, 2015Updated 10 years ago
- Training/test data for Dragnet☆42Jan 29, 2015Updated 11 years ago