lqfeng / ChineseCharacters
中文繁体和简体字符对照表
☆45Updated 3 months ago
Alternatives and similar repositories for ChineseCharacters:
Users that are interested in ChineseCharacters are comparing it to the libraries listed below
- 汉字自动拆分系统开发☆102Updated last year
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆118Updated 4 years ago
- 《现代汉语大词典》字词头☆26Updated 4 years ago
- 《现代汉语词典》(第7版)全文TXT☆267Updated 10 months ago
- 词语拼音数据☆480Updated 3 weeks ago
- 基于Pytorch 1.0 实现的中文断句与标点符号恢复。☆58Updated 6 years ago
- python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等☆164Updated 5 years ago
- ☆36Updated 11 months ago
- 使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的内置拼音数据☆57Updated 3 months ago
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆66Updated 4 years ago
- 基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现☆45Updated 4 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137Updated 4 years ago
- 中文「四角号码」数据与工具,可以将汉字拆解成和字形相关的编码,在机器学习中作为汉字的字形特征☆26Updated 5 years ago
- 最好的汉字数字(中文数字)-阿拉伯数字转换工具。包含"点二八","负百分之四十"等众多汉语表达方法。NLP,机器人工程必备! The Best Tool of Chinese Number to Digits☆365Updated 2 years ago
- repo for Tibetan corpora☆21Updated 2 years ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆23Updated 2 years ago
- SpellGCN☆252Updated 4 years ago
- 渊 - A project for Classical Chinese☆104Updated 3 years ago
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的 字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆374Updated 6 months ago
- 古文现代文翻译平行语料库☆104Updated 3 years ago
- SIGHAN中文纠错数据集及转换后格式☆64Updated 5 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆37Updated last year
- 汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese charac…☆295Updated 4 years ago
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆157Updated 2 months ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 7 years ago
- IDS data for CJK Unified Ideographs☆435Updated 2 years ago
- 拼音转汉字, convert pinyin to 汉字 using deep networks☆22Updated 4 years ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆458Updated last year
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆65Updated 6 years ago
- 简体中文词库包含词频+注音;特殊符号词库包含希腊字母,部分数学符号,Emoji表情,序号等.☆78Updated 2 years ago