FudanNLP / NLPCC-WordSeg-WeiboLinks
☆14Updated 3 years ago
Alternatives and similar repositories for NLPCC-WordSeg-Weibo
Users that are interested in NLPCC-WordSeg-Weibo are comparing it to the libraries listed below
Sorting:
- Yet Another Chinese Learner Corpus☆77Updated 3 years ago
- 中文机器阅读理解数据集☆103Updated 4 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆24Updated 6 years ago
- This is the official repo for paper "CSDS: A Fine-grained Chinese Dataset for Customer Service Dialogue Summarization", accepted by EMNLP…☆96Updated 2 years ago
- This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"☆295Updated 5 years ago
- CCL 2022 汉语学习者文本纠错评测☆141Updated 2 years ago
- OCNLI: 中文原版自然语言推理任务☆157Updated 3 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆119Updated last year
- 中文机器阅读理解数据集☆63Updated 5 years ago
- This is code for paper: MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction☆66Updated 2 years ago
- chinese version of longformer☆113Updated 4 years ago
- The code for our ACL2022 findings paper: CRACSpell: A Contextual Typo Robust Approach with Copy Mechanism to Improve Chinese Spelling Cor…☆75Updated 3 years ago
- ☆268Updated 11 months ago
- Datasets and codes for the paper "RiSAWOZ: A Large-Scale Multi-Domain Wizard-of-Oz Dataset with Rich Semantic Annotations for Task-Orient…☆65Updated 2 years ago
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆247Updated 2 years ago
- P-tuning方法在中文上的简单实验☆139Updated 4 years ago
- ☆167Updated 3 years ago
- A Chinese Long Text Summarization Dataset☆71Updated 2 years ago
- Pytorch version of BERT-whitening☆306Updated 3 years ago
- SIGHAN中文纠错数据集及转换后格式☆64Updated 5 years ago
- Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆237Updated 2 years ago
- Chinese AMR Corpus☆38Updated 3 months ago
- A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.☆316Updated last year
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆83Updated last year
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Updated 3 years ago
- ☆128Updated 2 years ago
- The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"☆228Updated 2 years ago
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆71Updated 10 months ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆116Updated last month
- ☆68Updated 5 years ago