archerhu / scel2mmseg
convert sogou input dict ( .scel file ) to mmseg(coreseek) dict
☆97Updated 11 years ago
Alternatives and similar repositories for scel2mmseg:
Users that are interested in scel2mmseg are comparing it to the libraries listed below
- 一个中文词库☆347Updated 10 years ago
- auto generate chinese words in huge text.☆91Updated 10 years ago
- convert sogou input dict ( .scel file ) to mmseg(coreseek) dict☆46Updated 3 weeks ago
- BosonNLP HTTP API 封装库(SDK)☆163Updated 6 years ago
- A Python package for pullword.com☆86Updated 4 years ago
- yaha☆266Updated 6 years ago
- the Chinese NLP full stack toolkit☆41Updated 10 years ago
- 基于行块分布函数的通用网页正文(及图片)抽取 - Python版本☆115Updated 8 years ago
- ☆59Updated 9 months ago
- Chinese Words Segment Library based on HMM model☆166Updated 10 years ago
- BosonNLP Analysis for ElasticSearch☆102Updated 8 years ago
- ☆99Updated 11 years ago
- A readability parser which can extract title, content, images from html pages☆87Updated 4 years ago
- a chinese segment base on crf☆233Updated 6 years ago
- a bot for paperweekly☆30Updated 7 years ago
- 下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库☆114Updated 7 years ago
- 有赞垃圾内容过滤工具☆283Updated 8 years ago
- rmmseg-cpp with Python interface☆189Updated 11 years ago
- autocomplete-redis is a quora like automatic autocompletion based on redis.☆204Updated 11 years ago
- The Python implementation for looking up Chinese administrative divisions.☆129Updated 4 years ago
- 把之前 hanLP-python-flask 裡面的 hanLP 單獨分出來☆59Updated 7 years ago
- Recognize CAPTCHA generated by bilibili.com☆116Updated 9 years ago
- 复旦的中文自然语言工具包☆72Updated 7 years ago
- Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法☆95Updated 8 years ago
- A OCR Search Engine With Tesseract Nutch Solr And PHP☆112Updated 6 years ago
- 中文自然语言处理工具包☆86Updated 9 years ago
- clone of https://code.google.com/p/cx-extractor☆40Updated 11 years ago
- paperweekly's forum☆117Updated 8 years ago
- 《基于行块分布函数的通用网页正文抽取》的Python实现方式☆30Updated 10 years ago
- A spectrum analysis based music finder☆107Updated 9 years ago