Code for CPM-2 Pre-Train
☆157Mar 18, 2023Updated 3 years ago
Alternatives and similar repositories for CPM-2-Pretrain
Users that are interested in CPM-2-Pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Finetune CPM-2☆81Mar 18, 2023Updated 3 years ago
- Introduction to CPM☆165Sep 26, 2021Updated 4 years ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆25Apr 16, 2021Updated 5 years ago
- Finetune CPM-1☆73Mar 18, 2023Updated 3 years ago
- Chinese Pre-Trained Language Models (CPM-LM) Version-I☆1,580Mar 18, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Efficient Inference for Big Models☆586Jan 24, 2023Updated 3 years ago
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)☆531Apr 10, 2023Updated 3 years ago
- ☆54Apr 15, 2022Updated 4 years ago
- ☆25Sep 29, 2021Updated 4 years ago
- ☆37Jan 5, 2021Updated 5 years ago
- [ACL'21] Dialogue Response Selection with Hierarchical Curriculum Learning☆21Nov 15, 2022Updated 3 years ago
- A fast MoE impl for PyTorch☆1,847Feb 10, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆305Mar 11, 2023Updated 3 years ago
- ☆49Dec 24, 2020Updated 5 years ago
- ☆34Jul 29, 2021Updated 4 years ago
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,941Jun 12, 2023Updated 2 years ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- 在bert4keras下加载CPM_LM模型☆51Nov 24, 2020Updated 5 years ago
- Pretrain CPM-1☆53Apr 20, 2021Updated 4 years ago
- Mengzi Pretrained Models☆544Nov 29, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 全球人工智能技术创新大赛-赛道三:小布助手对话短文本语义匹配☆11Apr 5, 2021Updated 5 years ago
- Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"☆16Jan 19, 2024Updated 2 years ago
- Distill CPM-1☆18May 6, 2021Updated 4 years ago
- ☆246Oct 21, 2022Updated 3 years ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆496Dec 30, 2022Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,104May 9, 2024Updated last year
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆37Apr 24, 2022Updated 3 years ago
- Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料☆1,001Feb 6, 2026Updated 2 months ago
- ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab☆2,048Mar 19, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 以词为基本单位的中文BERT☆477Nov 18, 2021Updated 4 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,696May 8, 2023Updated 2 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,439Jul 15, 2025Updated 9 months ago
- [EMNLP'21] Plan-then-Generate: Controlled Data-to-Text Generation via Planning☆76Jun 15, 2022Updated 3 years ago
- Joint Source-Target Self Attention with Locality Constraints☆20May 9, 2020Updated 5 years ago