Code for CPM-2 Pre-Train
☆157Mar 18, 2023Updated 3 years ago
Alternatives and similar repositories for CPM-2-Pretrain
Users that are interested in CPM-2-Pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Finetune CPM-2☆82Mar 18, 2023Updated 3 years ago
- Introduction to CPM☆165Sep 26, 2021Updated 4 years ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆25Apr 16, 2021Updated 4 years ago
- Finetune CPM-1☆74Mar 18, 2023Updated 3 years ago
- Chinese Pre-Trained Language Models (CPM-LM) Version-I☆1,580Mar 18, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Efficient Inference for Big Models☆586Jan 24, 2023Updated 3 years ago
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)☆531Apr 10, 2023Updated 2 years ago
- ☆54Apr 15, 2022Updated 3 years ago
- ☆25Sep 29, 2021Updated 4 years ago
- ☆37Jan 5, 2021Updated 5 years ago
- [ACL'21] Dialogue Response Selection with Hierarchical Curriculum Learning☆21Nov 15, 2022Updated 3 years ago
- A fast MoE impl for PyTorch☆1,846Feb 10, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆305Mar 11, 2023Updated 3 years ago
- ☆49Dec 24, 2020Updated 5 years ago
- ☆34Jul 29, 2021Updated 4 years ago
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,938Jun 12, 2023Updated 2 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- 在bert4keras下加载CPM_LM模型☆51Nov 24, 2020Updated 5 years ago
- Pretrain CPM-1☆53Apr 20, 2021Updated 4 years ago
- Mengzi Pretrained Models☆542Nov 29, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 全球人工智能技术创新大赛-赛道三:小布助手对话短文本语义匹配☆12Apr 5, 2021Updated 4 years ago
- Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"☆16Jan 19, 2024Updated 2 years ago
- Distill CPM-1☆18May 6, 2021Updated 4 years ago
- ☆247Oct 21, 2022Updated 3 years ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆496Dec 30, 2022Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,105May 9, 2024Updated last year
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Apr 24, 2022Updated 3 years ago
- Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料☆1,002Feb 6, 2026Updated last month
- ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab☆2,049Mar 19, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 以词为基本单位的中文BERT☆477Nov 18, 2021Updated 4 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,697May 8, 2023Updated 2 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,440Jul 15, 2025Updated 8 months ago
- [EMNLP'21] Plan-then-Generate: Controlled Data-to-Text Generation via Planning☆76Jun 15, 2022Updated 3 years ago
- Joint Source-Target Self Attention with Locality Constraints☆20May 9, 2020Updated 5 years ago
- Introduction to CPM☆17Jun 22, 2021Updated 4 years ago