xueyouluo / wiki-error-extract
根据维基百科历史编辑数据提取纠错语料。
☆12Updated 2 years ago
Alternatives and similar repositories for wiki-error-extract:
Users that are interested in wiki-error-extract are comparing it to the libraries listed below
- CGED & CSC☆22Updated 4 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Updated 2 years ago
- Dynamic Connected Networks for Chinese Spelling Check☆50Updated 9 months ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- Conversational Word Embedding for Retrieval-based Dialog System (ACL2020)☆30Updated 4 years ago
- pytorch版unilm模型☆26Updated 3 years ago
- source code of EMNLP2021: A Lightweight Pretrained Model for Chinese Spelling Check☆14Updated 3 years ago
- ☆12Updated 3 years ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆67Updated 3 years ago
- Dataset and Baseline for SMP-MCC2020☆23Updated last year
- Code of zlyang's master dissertation for Chinese grammatical error correction.☆34Updated 5 years ago
- using lear to do ner extraction☆29Updated 2 years ago
- A Pytorch implementation for "Hierarchical Attention Network with Pairwise Loss for Chinese Zero Pronoun Resolution“ (AAAI 2020).☆9Updated 4 years ago
- kenlm语言模型,并提供python的rest服务☆29Updated 6 years ago
- This repository is for the paper "Confusionset-guided Pointer Networks for Chinese Spelling Check"☆58Updated 5 years ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆37Updated 2 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆65Updated 4 years ago
- 关键词抽取项目☆24Updated 4 years ago
- CCL 2022 汉语学习者文本纠错评测☆138Updated 2 years ago
- ☆59Updated 5 years ago
- Datasets and codes for the paper "RiSAWOZ: A Large-Scale Multi-Domain Wizard-of-Oz Dataset with Rich Semantic Annotations for Task-Orient…☆63Updated 2 years ago
- SIGHAN中文纠错数据集及转换后格式☆63Updated 4 years ago
- A grammatical error correction reading list maintained by BLCU ICALL Research Group☆46Updated 2 years ago
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆71Updated 4 years ago
- ☆126Updated 2 years ago
- A large-scale cleaned Chinese chitchat corpus and Chinese dialogpt models☆34Updated 4 years ago
- This is the repository for NLPCC2020 task AutoIE☆51Updated 4 years ago
- 中文版unilm预训练模型☆83Updated 3 years ago