17zuoye / detdupLinks
Detect duplicated items。内容排重框架。
☆11Updated 10 years ago
Alternatives and similar repositories for detdup
Users that are interested in detdup are comparing it to the libraries listed below
Sorting:
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Updated 11 years ago
- A Python package for pullword.com☆86Updated 5 years ago
- Pure python NLP toolkit☆55Updated 9 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- Distributed text analysis suite based on Celery☆96Updated 2 years ago
- Chinese Words Segment Library based on HMM model☆166Updated 11 years ago
- a text analyzing (match, rewrite, extract) engine (python edition)☆80Updated 8 years ago
- 把之前 hanLP-python-flask 裡面的 hanLP 單獨分出來☆59Updated 7 years ago
- a chinese segment base on crf☆234Updated 6 years ago
- 中文自然语言处理工具包☆86Updated 10 years ago
- yaha☆267Updated 6 years ago
- ☆99Updated 11 years ago
- A distributed Sina Weibo Search spider base on Scrapy and Redis.☆144Updated 12 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆79Updated 12 years ago
- rmmseg-cpp with Python interface☆189Updated 11 years ago
- Caver: a toolkit for multilabel text classification.☆39Updated 6 years ago
- auto generate chinese words in huge text.☆92Updated 10 years ago
- ☆68Updated 10 years ago
- autocomplete-redis is a quora like automatic autocompletion based on redis.☆204Updated 11 years ago
- scikit-learn-doc-zh☆16Updated 9 years ago
- ☆56Updated 9 years ago
- Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法☆95Updated 8 years ago
- Detect duplicated items framework。内容排重框架。☆12Updated 10 years ago
- Academic Search Engine using Scrapy, MongoDB, Lucene/Solr, Tika, Struts2, Jquery, Bootstrap, D3, CAS☆100Updated 12 years ago
- convert sogou input dict ( .scel file ) to mmseg(coreseek) dict☆96Updated 12 years ago
- BosonNLP HTTP API 封装库(SDK)☆163Updated 6 years ago
- Lean Semantic Web tutorials☆128Updated 11 years ago
- Chinese Natural Language Processing tools and examples☆162Updated 9 years ago
- 开源中文分词工具包,中文分词Web API,Lucene中文分词,中英文混合分词☆43Updated 4 years ago
- Chinese Synonym Library☆123Updated 7 years ago