StuPeter / Sougou_dict_spider
搜狗词库爬虫,全类目下载,自动分类,scel转txt
☆208Updated 11 months ago
Alternatives and similar repositories for Sougou_dict_spider:
Users that are interested in Sougou_dict_spider are comparing it to the libraries listed below
- THUOCL(THU Open Chinese Lexicon)中文词库☆907Updated last year
- 由搜狗细胞词库生成的谷歌拼音输入法词典 A dict for Google Pinyin Input, exported from Sougou Pinyin Input.☆63Updated 8 years ago
- 中文预处理语料☆107Updated 6 years ago
- 《现代汉语词典》(第7版)全文TXT☆264Updated 9 months ago
- 简体中文词库包含词频+注音;特殊符号词库包含希腊字母,部分数学符号,Emoji表情,序号等.☆76Updated 2 years ago
- 汉字自动拆分系统开发☆102Updated last year
- 拼音转汉字, 拼音输入法引擎, pin yin -> 拼音☆605Updated 9 months ago
- 五笔字型超大字符集编码数据库☆89Updated 2 years ago
- 下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库☆113Updated 7 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆194Updated 3 years ago
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆115Updated 4 years ago
- 汉字五笔转换工具☆33Updated 6 years ago
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆66Updated 6 years ago
- ☆58Updated 7 years ago
- The most complete Chinese dictionaries ever. 史上最全的中文分类词库,包含地理信息、电子游戏、工程应用、农林牧渔、人文科学、社会科学、生活百科、医学医药、艺术设计、娱乐休闲、运动休闲、自然科学等12大类的超级字典。☆76Updated 4 years ago
- 词语拼音数据☆475Updated 2 months ago
- 同义词表,反义词表,否定词表☆527Updated 5 months ago
- 各大中文分词性能评测☆157Updated 6 years ago
- DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task。…☆683Updated 3 years ago
- 维基百科中文语料整理☆295Updated 7 years ago
- 中文相关词典和语料库。☆172Updated 10 years ago
- 中文语料库-每日自动更新版 ── 语料文件☆145Updated 4 years ago
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆365Updated 5 months ago
- 打字不翻页——Rime 输入法 双拼+辅助码方案☆138Updated 2 weeks ago
- 《通用规范汉字表》+ 注音 + Rime 字表☆47Updated 2 years ago
- 📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)☆706Updated 3 months ago
- colordict词典库☆86Updated 10 years ago
- 汉语古典文本资料库☆272Updated 7 years ago
- trime同文自用配置备份及分享☆49Updated last year
- 漢語拆字字典☆766Updated 2 years ago