TsinghuaAI/CPM-2-Pretrain

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TsinghuaAI/CPM-2-Pretrain)

TsinghuaAI / CPM-2-Pretrain

Code for CPM-2 Pre-Train

☆157

Alternatives and similar repositories for CPM-2-Pretrain

Users that are interested in CPM-2-Pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TsinghuaAI / CPM-2-Finetune
View on GitHub
Finetune CPM-2
☆80Mar 18, 2023Updated 3 years ago
TsinghuaAI / CPM
View on GitHub
Introduction to CPM
☆164Sep 26, 2021Updated 4 years ago
TsinghuaAI / TDS
View on GitHub
A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline
☆25Apr 16, 2021Updated 5 years ago
TsinghuaAI / CPM-1-Finetune
View on GitHub
Finetune CPM-1
☆73Mar 18, 2023Updated 3 years ago
TsinghuaAI / CPM-1-Generate
View on GitHub
Chinese Pre-Trained Language Models (CPM-LM) Version-I
☆1,579Mar 18, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OpenBMB / BMInf
View on GitHub
Efficient Inference for Big Models
☆583Jul 7, 2026Updated 2 weeks ago
BAAI-WuDao / P-tuning
View on GitHub
Finetune CPM-1
☆24Jun 20, 2021Updated 5 years ago
Harry-Chen / InfMoE
View on GitHub
Inference framework for MoE layers based on TensorRT with Python binding
☆40May 31, 2021Updated 5 years ago
yangjianxin1 / CPM
View on GitHub
Easy-to-use CPM for Chinese text generation（基于CPM的中文文本生成）
☆530Apr 10, 2023Updated 3 years ago
TsinghuaAI / CUGE
View on GitHub
☆54Apr 15, 2022Updated 4 years ago
BAAI-WuDao / EVA
View on GitHub
☆25Sep 29, 2021Updated 4 years ago
jm12138 / CPM-Generate-Pytorch
View on GitHub
☆36Jan 5, 2021Updated 5 years ago
laekov / fastmoe
View on GitHub
A fast MoE impl for PyTorch
☆1,857Feb 10, 2025Updated last year
yxuansu / HCL
View on GitHub
[ACL'21] Dialogue Response Selection with Hierarchical Curriculum Learning
☆21Nov 15, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Yifan-Gao / open_retrieval_conversational_machine_reading
View on GitHub
Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset
☆13Nov 19, 2022Updated 3 years ago
thu-coai / EVA
View on GitHub
EVA: Large-scale Pre-trained Chit-Chat Models
☆304Mar 11, 2023Updated 3 years ago
TsinghuaAI / CPM-KG
View on GitHub
☆49Dec 24, 2020Updated 5 years ago
BAAI-WuDao / Chinese-Transformer-XL
View on GitHub
☆34Jul 29, 2021Updated 4 years ago
thu-coai / CDial-GPT
View on GitHub
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
☆1,957Jun 12, 2023Updated 3 years ago
MichaelZhouwang / VLUE
View on GitHub
This repo contains codes and instructions for baselines in the VLUE benchmark.
☆41Jul 16, 2022Updated 4 years ago
bojone / CPM_LM_bert4keras
View on GitHub
在bert4keras下加载CPM_LM模型
☆51Nov 24, 2020Updated 5 years ago
TsinghuaAI / CPM-1-Pretrain
View on GitHub
Pretrain CPM-1
☆53Apr 20, 2021Updated 5 years ago
Langboat / Mengzi
View on GitHub
Mengzi Pretrained Models
☆544Nov 29, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yzheng51 / 2021-GAIIC-Task3-Preliminary-Share
View on GitHub
全球人工智能技术创新大赛-赛道三：小布助手对话短文本语义匹配
☆11Apr 5, 2021Updated 5 years ago
abhinavkashyap / dct
View on GitHub
Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"
☆16Jan 19, 2024Updated 2 years ago
TsinghuaAI / CPM-1-Distill
View on GitHub
Distill CPM-1
☆18May 6, 2021Updated 5 years ago
FudanNLP / CPT
View on GitHub
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
☆494Dec 30, 2022Updated 3 years ago
deepdialog / CPM-LM-TF2
View on GitHub
☆246Oct 21, 2022Updated 3 years ago
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,110May 9, 2024Updated 2 years ago
thunlp / Knowledge-Inheritance
View on GitHub
Source code for paper: Knowledge Inheritance for Pre-trained Language Models
☆37Apr 24, 2022Updated 4 years ago
CLUEbenchmark / CLUECorpus2020
View on GitHub
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
☆1,017Feb 6, 2026Updated 5 months ago
alibaba / AliceMind
View on GitHub
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
☆2,042Mar 19, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ZhuiyiTechnology / WoBERT
View on GitHub
以词为基本单位的中文BERT
☆475Nov 18, 2021Updated 4 years ago
airaria / TextBrewer
View on GitHub
A PyTorch-based knowledge distillation toolkit for natural language processing
☆1,705May 8, 2023Updated 3 years ago
luciusssss / why-learn-shortcut
View on GitHub
[ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?
☆16Aug 8, 2023Updated 2 years ago
ymcui / Chinese-ELECTRA
View on GitHub
Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）
☆1,433Apr 19, 2026Updated 3 months ago
yxuansu / PlanGen
View on GitHub
[EMNLP'21] Plan-then-Generate: Controlled Data-to-Text Generation via Planning
☆76Jun 15, 2022Updated 4 years ago
mt-upc / joint
View on GitHub
Joint Source-Target Self Attention with Locality Constraints
☆20May 9, 2020Updated 6 years ago
BAAI-WuDao / CPM
View on GitHub
Introduction to CPM
☆17Jun 22, 2021Updated 5 years ago