A simple implementation of simhash algorithm by java.
☆155Oct 10, 2020Updated 5 years ago
Alternatives and similar repositories for simhash-java
Users that are interested in simhash-java are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simhash Java单机实现☆116May 20, 2022Updated 3 years ago
- ☆23Nov 5, 2017Updated 8 years ago
- Text retrieval database based on simhash similarity search☆26Mar 27, 2023Updated 3 years ago
- Java implementation for MinHash and LSH for finding near duplicate documents as measured by Jaccard similarity.☆33Mar 30, 2015Updated 11 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Mar 16, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This provides tools for b-bit MinHash algorism.☆39Nov 21, 2025Updated 5 months ago
- Easy-to-use Java library for similarity checking of strings or numeric-series☆20Jan 23, 2020Updated 6 years ago
- Simple example of Java API☆20Aug 9, 2021Updated 4 years ago
- Free-to-use plugins made for SRPG Studio.☆12Apr 25, 2026Updated 3 weeks ago
- Open Source Implementation of Simhash in Python☆24Sep 14, 2017Updated 8 years ago
- detection quora duplicate question☆19Apr 5, 2017Updated 9 years ago
- 通过机器学习,贝叶斯二之一形式,对短信进行垃圾消息过滤.☆16Mar 9, 2017Updated 9 years ago
- A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It…☆203Jul 26, 2020Updated 5 years ago
- A Simple Flickr App Using NativeScript☆13Jun 6, 2015Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 使用simhash算法,快速索引和查询大量文本简历☆21Dec 16, 2015Updated 10 years ago
- A Java implementation of doc2vec in ICML'14☆30Jul 23, 2015Updated 10 years ago
- SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex☆19Nov 18, 2022Updated 3 years ago
- 2018年研究生室友推 荐系统——Roommate Matching——简单小应用帮助同学寻找习性相同的室友☆11Apr 3, 2019Updated 7 years ago
- Based on spring-cache,integrate local cache [ehcache] and distributed cache [redis] to make secondary cache.☆11May 26, 2023Updated 2 years ago
- import wikidata to neo4j☆27Jan 24, 2016Updated 10 years ago
- view CAD paper on Android☆11May 24, 2012Updated 13 years ago
- ☆25Mar 22, 2013Updated 13 years ago
- ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典☆6,534Nov 19, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)☆225Dec 22, 2022Updated 3 years ago
- 整合spring、elasticsearch构建搜索☆24Dec 16, 2022Updated 3 years ago
- simple simhashing in hadoop with cascading☆33May 9, 2011Updated 15 years ago
- 文档去重功能是为了解决搜索引擎的文档语义重复的问题,方法是多重哈希下的语义指纹算法。☆11Aug 17, 2013Updated 12 years ago
- textteaser中文版☆11Jun 2, 2018Updated 7 years ago
- A LaTeX package cocktail for grad school level writing/presentation☆13Feb 11, 2021Updated 5 years ago
- Automatically exported from code.google.com/p/jbirch☆12Sep 6, 2022Updated 3 years ago
- ☆61Jul 19, 2024Updated last year
- ☆11May 16, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for the paper Faster Phrase-Based Decoding by Refining Feature State☆14Jan 9, 2023Updated 3 years ago
- Mirror of 0.1.1 release of clausie from http://www.mpi-inf.mpg.de/departments/databases-and-information-systems/software/clausie/☆14Jan 4, 2015Updated 11 years ago
- ☆13Sep 6, 2016Updated 9 years ago
- carCV☆19Feb 6, 2014Updated 12 years ago
- Simple Java library for transforming an Object to another Object☆12Aug 4, 2025Updated 9 months ago
- 车牌识别收费系统,是车牌识别Android版开源项目,使用EasyPR开源框架作为识别核心库,本项目只为学习.☆15Mar 25, 2017Updated 9 years ago
- Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywo…☆921Sep 18, 2023Updated 2 years ago