THUNLP-AIPoet / CCPMLinks
☆38Updated 3 years ago
Alternatives and similar repositories for CCPM
Users that are interested in CCPM are comparing it to the libraries listed below
Sorting:
- 本仓库是基于bert4keras实现的古文-现代文翻译模型。具体使用了基于掩码自注意力机制的UNILM(Li al., 2019)预训练模型作为翻译系统的backbone。我们首先使用了普通的中文(现代文)BERT、Roberta权重作为UNILM的初始权重以训练UNILM…☆50Updated 3 years ago
- Chinese AMR Corpus☆38Updated 3 months ago
- The first Chinese metaphor corpus serving for identification and generation. 中文比喻数据集. Presented at COLING 2022.☆42Updated 2 years ago
- Yet Another Chinese Learner Corpus☆77Updated 3 years ago
- 中文机器阅读理解数据集☆103Updated 4 years ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆489Updated 2 years ago
- 历届中文句法错误诊断技术评测数据集☆42Updated 3 years ago
- A Chinese legal case retrieval dataset.☆143Updated last year
- 非官方的MDCSpell论文的实现☆18Updated 2 years ago
- This is the repository of EMNLP'2022 paper: "Improving Multi-turn Emotional Support Dialogue Generation with Lookahead Strategy Planning"…☆43Updated 2 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆119Updated last year
- EVA: Large-scale Pre-trained Chit-Chat Models☆307Updated 2 years ago
- ☆48Updated last year
- ☆106Updated last year
- 基于GOOGLE T5中文生成式模型的摘要生成/指代消解,支持batch批量生成,多进程☆226Updated last year
- CCL 2022 汉语学习者文本纠错评测☆141Updated 2 years ago
- This is the official repo for paper "CSDS: A Fine-grained Chinese Dataset for Customer Service Dialogue Summarization", accepted by EMNLP…☆96Updated 2 years ago
- ☆63Updated 3 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆24Updated 6 years ago
- Source code for the paper "Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granular…☆41Updated 2 years ago
- ☆268Updated 11 months ago
- Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems☆270Updated last year
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆83Updated last year
- KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation☆484Updated 2 years ago
- A framework for cleaning Chinese dialog data☆272Updated 4 years ago
- CCL 2020 中文隐喻识别与情感分析任务说明与数据集☆41Updated 4 years ago
- A chinese simile recognition dataset of "Xiang".☆22Updated 2 years ago
- Source code and checkpoints for legal pre-trained language models.☆185Updated 4 years ago
- Implementation of AAAI2021 paper "Writing Polishment with Simile: Task, Dataset and A Neural Approach"☆21Updated 4 years ago
- This is the repository of the Ape210K dataset and baseline models.☆194Updated 5 years ago