guo-yong-zhi / CharMap
汉字形近字分布
☆13Updated 3 years ago
Alternatives and similar repositories for CharMap:
Users that are interested in CharMap are comparing it to the libraries listed below
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137Updated 4 years ago
- 错别字纠正算法。调用pycorrector接口,使用规则。☆68Updated 5 years ago
- 汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese charac…☆293Updated 4 years ago
- 基于“音形码”的中文字符串相似度计算方法☆224Updated 4 years ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆456Updated last year
- SpellGCN☆252Updated 4 years ago
- ☆23Updated 4 years ago
- Code for chinese error detection module, using n-gram and bi-lstm☆135Updated 6 years ago
- This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"☆294Updated 5 years ago
- 中文单词自动纠错☆121Updated 3 years ago
- 中文纠错☆92Updated 3 years ago
- 基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现☆45Updated 4 years ago
- python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等☆164Updated 5 years ago
- 基于bert进行中文文本纠错☆234Updated last year
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆65Updated 6 years ago
- 中文谐音词/字库(同音词/字)Chinese Homophones☆103Updated 5 years ago
- 中文文本纠错模型,keras实现☆74Updated 3 years ago
- 利用语言模型,纠正OCR识别错误☆464Updated last year
- 用bert4keras加载CDial-GPT☆38Updated 4 years ago
- ☆127Updated 2 years ago
- 中文语料☆18Updated 6 years ago
- ☆86Updated 3 years ago
- ☆51Updated 4 years ago
- 李傲龍的博客☆81Updated 9 months ago
- 提取中文的偏旁部首和拼音(一些生僻字的拼音没有补全,待优化)☆43Updated 6 years ago
- ☆55Updated 3 years ago
- A Multi-modal Model Chinese Spell Checker Released on ACL2021.☆159Updated last year
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆129Updated last year
- 中文文本错别字检测以及自动纠错 / Autochecker & autocorrecter for chinese☆296Updated 7 years ago
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆73Updated 5 years ago