sfyc23 / python-wubi
汉字五笔转换工具
☆33Updated 6 years ago
Alternatives and similar repositories for python-wubi:
Users that are interested in python-wubi are comparing it to the libraries listed below
- 《现代汉语大词典》字词头☆26Updated 4 years ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 7 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆196Updated 3 years ago
- 物种名称语料库。植物名,动物名。☆48Updated last year
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆82Updated 2 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137Updated 4 years ago
- 图书名语料库。含部分电影、游戏名称。☆71Updated last year
- 搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征☆23Updated 6 years ago
- 转换搜狗拼音词库为txt文件☆50Updated 7 years ago
- 提取中文的偏旁部首和拼音(一些生僻字的拼音没有补全,待优化)☆43Updated 6 years ago
- 各大中文分词性能评测☆157Updated 6 years ago
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆65Updated 6 years ago
- 中文谐音词/字库(同音词/字)Chinese Homophones☆103Updated 5 years ago
- 常用的中文停用词表☆74Updated 7 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆122Updated 5 years ago
- colordict词典库☆86Updated 10 years ago
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆118Updated 4 years ago
- 中文分词工具评估☆61Updated 2 years ago
- 汉字自动拆分系统开发☆102Updated last year
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆59Updated 6 years ago
- 新词发现算法(NewWordDetection)☆92Updated 4 years ago
- 成语数据 Chinese idiom data☆74Updated 7 years ago
- Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)☆30Updated 4 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 7 years ago
- 中文纠错☆92Updated 3 years ago
- 金庸小说人物关系图谱构建☆61Updated 5 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆24Updated 6 years ago
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆374Updated 6 months ago
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆73Updated 5 years ago
- 由搜狗细胞词库生成的谷歌拼音输入法词典 A dict for Google Pinyin Input, exported from Sougou Pinyin Input.☆65Updated 8 years ago