TsinghuaAI / CPM-1-Distill
Distill CPM-1
☆17Updated 4 years ago
Alternatives and similar repositories for CPM-1-Distill
Users that are interested in CPM-1-Distill are comparing it to the libraries listed below
Sorting:
- Finetune CPM-1☆74Updated 2 years ago
- ☆37Updated 4 years ago
- ☆34Updated 3 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆68Updated last year
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 4 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆74Updated 2 years ago
- 用bert4keras加载CDial-GPT☆38Updated 4 years ago
- Code for CPM-2 Pre-Train☆158Updated 2 years ago
- Finetune CPM-2☆82Updated 2 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Updated last year
- 零样本学习测评基准,中文版☆56Updated 3 years ago
- ☆59Updated last year
- 中文版unilm预训练模型☆83Updated 4 years ago
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆75Updated 4 years ago
- K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP …☆31Updated 2 years ago
- 无监督文本生成的一些方法☆48Updated 3 years ago
- Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension☆166Updated 3 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- A official implementation of SARG: A Novel Semi Autoregressive Generator for Multi-turn Incomplete Utterance Restoration☆49Updated 3 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- Introduction to CPM☆165Updated 3 years ago
- A more efficient GLM implementation!☆55Updated 2 years ago
- 真 · “Deep Learning for Humans”☆141Updated 3 years ago
- Pretrain CPM-1☆51Updated 4 years ago
- 分享一些S2S在实际应用中遇到的问题和解决方法。☆27Updated 4 years ago
- 本项目收集目前对话系统论文中,已公开的,用于训练中(英)文的训练集。Datasets for training Dialog.☆22Updated 5 years ago
- 对话改写介绍文章☆97Updated last year
- ☆53Updated 3 years ago
- P-tuning方法在中文上的简单实验☆139Updated 4 years ago