TsinghuaAI / CPM-1-PretrainLinks
Pretrain CPM-1
☆53Updated 4 years ago
Alternatives and similar repositories for CPM-1-Pretrain
Users that are interested in CPM-1-Pretrain are comparing it to the libraries listed below
Sorting:
- Introduction to CPM☆166Updated 3 years ago
- ☆53Updated 3 years ago
- Finetune CPM-1☆75Updated 2 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆69Updated 2 years ago
- Finetune CPM-2☆83Updated 2 years ago
- FLASHQuad_pytorch☆68Updated 3 years ago
- Tracking the progress in NLG for task-oriented dialogue system (resources, code, and new frontiers etc.)☆134Updated 3 years ago
- RoFormer升级版☆154Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated 2 years ago
- OCNLI: 中文原版自然语言推理任务☆157Updated 3 years ago
- Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting☆122Updated 2 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- ☆219Updated 2 years ago
- P-tuning方法在中文上的简单实验☆140Updated 4 years ago
- ☆59Updated 2 years ago
- A Dataset for Multi-Turn Dialogue Reasoning☆326Updated 4 years ago
- Code for CPM-2 Pre-Train☆158Updated 2 years ago
- ☆71Updated 3 years ago
- 擂台赛3-大规模预训练调优比赛的示例代码与baseline实现☆37Updated 2 years ago
- R-Drop方法在中文任务上的简单实验☆91Updated 3 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆307Updated 2 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆75Updated 2 years ago
- 分享一些S2S在实际应用中遇到的问题和解决方法。☆28Updated 5 years ago
- ☆84Updated last year
- Apply the Circular to the Pretraining Model☆38Updated 3 years ago
- ☆168Updated 3 years ago
- A paper list of pre-trained language models (PLMs).☆81Updated 3 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆73Updated 2 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆105Updated 2 years ago
- Open source code for EMNLP 2020 Findings Paper "AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slo…☆86Updated 3 years ago