lqfeng / ChineseCharacters
中文繁体和简体字符对照表
☆39Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ChineseCharacters
- 拼音转汉字, convert pinyin to 汉字 using deep networks☆22Updated 4 years ago
- simple-pinyin 基于隐马尔可夫模型的简易拼音输入法(拼音转汉字)☆43Updated 2 months ago
- CCL 2023 汉语学习者文本纠错评测☆26Updated last year
- 大规模中文语料☆38Updated 5 years ago
- 中文纠错☆91Updated 2 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆128Updated 4 years ago
- 使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的内置拼音数据☆44Updated 8 months ago
- 汉字自动拆分系统开发☆102Updated last year
- 渊 - A project for Classical Chinese☆94Updated 2 years ago
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆38Updated 2 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆35Updated last year
- 基于Pytorch 1.0 实现的中文断句与标点符号恢复。☆55Updated 5 years ago
- 《现代汉语大词典》字词头☆26Updated 3 years ago
- ☆29Updated 5 months ago
- ☆22Updated 4 years ago
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆65Updated 4 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆107Updated 3 months ago
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆69Updated 4 years ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆22Updated 2 years ago
- 词语拼音数据☆454Updated 8 months ago
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆46Updated last year
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆68Updated 3 years ago
- The case study and multilingfual performance of ICASSP submission☆19Updated 2 years ago
- repo for Tibetan corpora☆21Updated last year
- 汉字形近字分布☆13Updated 2 years ago
- 基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现☆45Updated 4 years ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 5 months ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆37Updated 2 years ago
- python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等☆163Updated 5 years ago