thu-coai / KdConvLinks

KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation

☆495

Alternatives and similar repositories for KdConv

Users that are interested in KdConv are comparing it to the libraries listed below

Sorting:

thu-coai / CrossWOZ
A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
☆708Updated last year
fastnlp / CPT
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
☆493Updated 2 years ago
ZhuiyiTechnology / roformer-sim
SimBERT升级版（SimBERTv2）！
☆445Updated 3 years ago
lemon234071 / clean-dialog
A framework for cleaning Chinese dialog data
☆274Updated 4 years ago
pluto-junzeng / CNSD
中文自然语言推理数据集（A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset）
☆434Updated 5 years ago
ymcui / cmrc2018
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
☆444Updated 3 years ago
renmada / t5-pegasus-pytorch
☆420Updated last year
thu-coai / EVA
EVA: Large-scale Pre-trained Chit-Chat Models
☆307Updated 2 years ago
ZhuiyiTechnology / t5-pegasus
中文生成式预训练模型
☆569Updated 3 years ago
tongchangD / text_data_enhancement_with_LaserTagger
Modify Chinese text, modified on LaserTagger Model. 文本复述，基于lasertagger做中文文本数据增强。
☆323Updated last year
YunwenTechnology / Unilm
☆442Updated 3 years ago
destwang / CTCResources
☆271Updated last year
wdimmy / Automatic-Corpus-Generation
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
☆294Updated 6 years ago
ymcui / MacBERT
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
☆695Updated 4 months ago
cooelf / DeepUtteranceAggregation
Modeling Multi-turn Conversation with Deep Utterance Aggregation (COLING 2018)
☆287Updated 5 years ago
zhusleep / pytorch_chinese_lm_pretrain
pytorch中文语言模型预训练
☆387Updated 5 years ago
zejunwang1 / CSTS
中文自然语言推理与语义相似度数据集
☆365Updated 3 years ago
luhua-rain / MRC_Competition_Dureader
机器阅读理解冠军/亚军代码及中文预训练MRC模型
☆745Updated 3 years ago
ZhuiyiTechnology / WoBERT
以词为基本单位的中文BERT
☆472Updated 4 years ago
SunnyGJing / t5-pegasus-chinese
基于GOOGLE T5中文生成式模型的摘要生成/指代消解，支持batch批量生成，多进程
☆227Updated 2 years ago
iflytek / HFL-Anthology
Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL)
☆377Updated 2 years ago
CLUEbenchmark / FewCLUE
FewCLUE 小样本学习测评基准，中文版
☆517Updated 3 years ago
ZhuiyiTechnology / simbert
a bert for retrieval and generation
☆861Updated 4 years ago
luge-ai / luge-ai
☆441Updated 7 months ago
ShannonAI / ChineseBert
Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"
☆563Updated 2 years ago
bojone / SPACES
端到端的长本文摘要模型（法研杯2020司法摘要赛道）
☆399Updated last year
liushulinle / PLOME
Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021
☆239Updated 3 years ago
CLUEbenchmark / CLUEPretrainedModels
高质量中文预训练模型集合：最先进大模型、最快小模型、相似度专门模型
☆817Updated 5 years ago
CLUEbenchmark / SimCLUE
3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型
☆311Updated 3 years ago
ACL2020SpellGCN / SpellGCN
SpellGCN
☆252Updated 4 years ago