CreekLou / simhashLinks
An efficient algorithm for text similarity computation
☆61Updated 4 years ago
Alternatives and similar repositories for simhash
Users that are interested in simhash are comparing it to the libraries listed below
Sorting:
- A simple implementation of simhash algorithm by java.☆155Updated 4 years ago
- Simhash Java单机实现☆110Updated 3 years ago
- Chinese Word Segmentation Tool, THULAC的Java实现.☆84Updated 4 years ago
- a word2vec impl of Chinese language, based on deeplearning4j and ansj☆28Updated 4 years ago
- 自动抽取网页正 文的算法,用JAVA实现☆106Updated 8 years ago
- mltk web edition☆41Updated 9 years ago
- ☆23Updated 7 years ago
- The missing SVM-based text classification module implementing HanLP's interface☆47Updated 7 years ago
- FoolNLTK java version☆82Updated 6 years ago
- Tree-split 搬新家..给各位带来的不便深表歉意☆55Updated 8 years ago
- The implementation of bloomfilter with bit set of java and redis or others what is implemented by yourself.☆108Updated 6 years ago
- Document preprocessing for preparing formatted input data which is suitable for LibSVM tool.☆50Updated 8 years ago
- ltp4j: Language Technology Platform For Java☆161Updated 4 years ago
- 基于hanlp的elasticsearch分词插件☆157Updated 3 years ago
- Spider_SinaTweetCrawler, to crawl tweet content from sinaTweet. (java)