pluto-junzeng / C4-zhLinks
大规模中文语料
☆42Updated 5 years ago
Alternatives and similar repositories for C4-zh
Users that are interested in C4-zh are comparing it to the libraries listed below
Sorting:
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- 零样本学习测评基准,中文版☆56Updated 4 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- LORA微调BLOOMZ,参考BELLE☆25Updated 2 years ago
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆40Updated 3 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆75Updated 5 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆74Updated 2 years ago
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆69Updated 10 months ago
- ☆53Updated 3 years ago
- P-tuning方法在中文上的简单实验☆139Updated 4 years ago
- 中文图书语料MD5链接☆218Updated last year
- Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension☆167Updated 3 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆68Updated last year
- CCL 2022 汉语学习者文本纠错评测☆141Updated 2 years ago
- OCNLI: 中文原版自然语言推理任务☆156Updated 3 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆67Updated 2 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆117Updated 6 months ago
- 中文版unilm预训练模型☆83Updated 4 years ago
- 历届中文句法错误诊断技术评测数据集☆42Updated 3 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆129Updated 2 years ago
- 中文机器阅读理解数据集☆103Updated 4 years ago
- 中文 Instruction tuning datasets☆132Updated last year
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆68Updated 4 years ago
- ☆127Updated 2 years ago
- 中文bigbird预训练模型☆93Updated 2 years ago
- RoFormer升级版☆152Updated 2 years ago
- 对话改写介绍文章☆97Updated 2 years ago
- Datasets and codes for the paper "RiSAWOZ: A Large-Scale Multi-Domain Wizard-of-Oz Dataset with Rich Semantic Annotations for Task-Orient…☆64Updated 2 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Updated 3 years ago