LiangsLi / ChineseHomophonesLinks
中文谐音词/字库(同音词/字)Chinese Homophones
☆105Updated 5 years ago
Alternatives and similar repositories for ChineseHomophones
Users that are interested in ChineseHomophones are comparing it to the libraries listed below
Sorting:
- Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆235Updated 2 years ago
- ☆127Updated 2 years ago
- SIGHAN中文纠错数据集及转换后格式☆64Updated 5 years ago
- 李傲龍的博客☆81Updated 10 months ago
- 对话改写介绍文章☆97Updated last year
- 问题等价性判断数据预处理,包含添加对抗样本(同音字、近义词替换等)、获取样本的pattern(用通配符替换相同词汇,提取相同和不同词汇)☆39Updated 5 years ago
- 评估自然语言的流畅度☆115Updated 3 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137Updated 5 years ago
- python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等☆164Updated 5 years ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆68Updated 4 years ago
- 基于BERT的无监督分词和句法分析☆110Updated 4 years ago
- CCL 2022 汉语学习者文本纠错评测☆141Updated 2 years ago
- 基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现☆45Updated 4 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆92Updated 5 years ago
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆40Updated 3 years ago
- Dynamic Connected Networks for Chinese Spelling Check☆50Updated last year
- This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"☆294Updated 5 years ago
- The code for our ACL2022 findings paper: CRACSpell: A Contextual Typo Robust Approach with Copy Mechanism to Improve Chinese Spelling Cor…☆75Updated 3 years ago
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆183Updated 4 years ago
- 基于bert进行中文文本纠错☆235Updated last year
- SpellGCN☆252Updated 4 years ago
- ☆267Updated 10 months ago
- A Multi-modal Model Chinese Spell Checker Released on ACL2021.☆159Updated last year
- ☆52Updated 4 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的 翻译与构建,基于深度学习的文本蕴含判定模型构建…☆172Updated 6 years ago
- ☆41Updated 5 years ago
- 中文NLP数据集☆153Updated 5 years ago
- 中文文本时间抽取、时间转换及标准化☆51Updated 4 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Updated 3 years ago
- Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。☆157Updated 4 years ago