xueyouluo / wiki-error-extract
根据维基百科历史编辑数据提取纠错语料。
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for wiki-error-extract
- CGED & CSC☆22Updated 4 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆56Updated 2 years ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆68Updated 3 years ago
- Dynamic Connected Networks for Chinese Spelling Check☆50Updated 7 months ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- source code of EMNLP2021: A Lightweight Pretrained Model for Chinese Spelling Check☆14Updated 3 years ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆37Updated 2 years ago
- pytorch版unilm模型☆25Updated 3 years ago
- SIGHAN中文纠错数据集及转换后格式☆63Updated 4 years ago
- Code of zlyang's master dissertation for Chinese grammatical error correction.☆34Updated 5 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆65Updated 4 years ago
- ☆12Updated 3 years ago
- CCL 2022 汉语学习者文本纠错评测☆135Updated last year
- 关键词抽取项目☆24Updated 4 years ago
- ☆20Updated 4 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Updated 2 years ago
- code of our EMNLP-19 Paper, CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding☆28Updated 5 years ago
- kenlm语言模型,并提供python的rest服务☆29Updated 6 years ago
- using lear to do ner extraction☆29Updated 2 years ago
- 2021语言与智能技术竞赛:机器阅读理解任务☆30Updated 3 years ago
- 基于seq2edit (Gector) 的中文文本纠错。☆26Updated 2 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆14Updated last year
- Code for "A Unified Model for Joint Chinese Word Segmentation and Dependency Parsing"☆38Updated 2 years ago
- ☆126Updated 2 years ago
- This repo contains some experiments of text matching on Chinese dataset LCQMC☆27Updated 4 years ago
- CCL2022汉语学习者文本纠错评测任务赛道二——CGED-8第一名解决方案☆52Updated last year
- 文档记录☆15Updated 3 years ago
- 无监督文本生成的一些方法☆49Updated 3 years ago