pluto-junzeng / C4-zh
大规模中文语料
☆40Updated 5 years ago
Alternatives and similar repositories for C4-zh:
Users that are interested in C4-zh are comparing it to the libraries listed below
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆74Updated 4 years ago
- 零样本学习测评基准,中文版☆56Updated 3 years ago
- ☆53Updated 2 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated last year
- LORA微调BLOOMZ,参考BELLE☆25Updated 2 years ago
- CCL 2022 汉语学习者文本纠错评测☆138Updated 2 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆75Updated 2 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆129Updated last year
- 中文版unilm预训练模型☆83Updated 4 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆112Updated 3 months ago
- 对话改写介绍文章☆95Updated last year
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆40Updated 3 years ago
- ☆76Updated last year
- ☆127Updated 2 years ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆39Updated 2 years ago
- moss chat finetuning☆50Updated 11 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated 11 months ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆115Updated 2 months ago
- 时间抽取、解析、标准化工具☆51Updated 2 years ago
- Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension☆166Updated 2 years ago
- 中文图书语料MD5链接☆217Updated last year
- The code for our ACL2022 findings paper: CRACSpell: A Contextual Typo Robust Approach with Copy Mechanism to Improve Chinese Spelling Cor…☆75Updated 2 years ago
- 中文机器阅读理解数据集☆102Updated 4 years ago
- 文本去重☆69Updated 10 months ago
- NTK scaled version of ALiBi position encoding in Transformer.☆67Updated last year
- 历届中文句法错误诊断技术评测数据集☆38Updated 2 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Updated 3 years ago
- 中文 Instruction tuning datasets☆129Updated 11 months ago