Macielyoung / Confused_Chinese
Fetching confused chars, including same pronunciation, similar pronunciation and similar character pattern
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Confused_Chinese
- 基于T5模型的中文文本纠错☆25Updated last week
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Updated last year
- pytorch版unilm模型☆25Updated 3 years ago
- Dynamic Connected Networks for Chinese Spelling Check☆49Updated 7 months ago
- 基于seq2edit (Gector) 的中文文本纠错。☆26Updated last year
- bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示☆16Updated 2 years ago
- FinCUGE Instruction dataset☆10Updated last year
- source code of EMNLP2021: A Lightweight Pretrained Model for Chinese Spelling Check☆13Updated 3 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- CCL 2023 汉语学习者文本纠错评测☆26Updated last year
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆38Updated 2 years ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆37Updated 2 years ago
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆27Updated last year
- ☆10Updated 2 years ago
- Finetune t5 and bart on Chinese Grammatical Error Correction data.☆16Updated 2 years ago
- Chinese Grammatical Error Diagnosis☆11Updated 3 years ago
- ☆57Updated last year
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆108Updated 3 months ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆67Updated 3 years ago
- The code for our ACL2022 findings paper: CRACSpell: A Contextual Typo Robust Approach with Copy Mechanism to Improve Chinese Spelling Cor…☆72Updated 2 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Updated 11 months ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Updated 2 years ago
- using lear to do ner extraction☆29Updated 2 years ago
- ☆45Updated 11 months ago
- ☆126Updated 2 years ago
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆80Updated 8 months ago
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆13Updated 2 years ago
- ☆52Updated 8 months ago
- ☆46Updated 3 years ago
- 格物-多语言和中文大规模预训练模型-轻量版,涵盖纯中文、知识增强、113个语种多语言,采用主流Roberta架构,适用于NLU和NLG任务, 支持pytorch、tensorflow、uer、huggingface等框架。 Multilingual and Chinese …☆25Updated last year