TsinghuaAI/CPM-1-Finetune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TsinghuaAI/CPM-1-Finetune)

TsinghuaAI / CPM-1-Finetune

Finetune CPM-1

☆73

Alternatives and similar repositories for CPM-1-Finetune

Users that are interested in CPM-1-Finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jm12138 / CPM-Generate-Pytorch
View on GitHub
☆36Jan 5, 2021Updated 5 years ago
TsinghuaAI / CPM-1-Generate
View on GitHub
Chinese Pre-Trained Language Models (CPM-LM) Version-I
☆1,579Mar 18, 2023Updated 3 years ago
BAAI-WuDao / EVA
View on GitHub
☆25Sep 29, 2021Updated 4 years ago
TsinghuaAI / TDS
View on GitHub
A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline
☆25Apr 16, 2021Updated 5 years ago
TsinghuaAI / CPM-2-Pretrain
View on GitHub
Code for CPM-2 Pre-Train
☆157Mar 18, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
TsinghuaAI / CPM-1-Distill
View on GitHub
Distill CPM-1
☆18May 6, 2021Updated 5 years ago
TsinghuaAI / CPM-1-Pretrain
View on GitHub
Pretrain CPM-1
☆53Apr 20, 2021Updated 5 years ago
BAAI-WuDao / Chinese-Transformer-XL
View on GitHub
☆34Jul 29, 2021Updated 4 years ago
TsinghuaAI / CPM-2-Finetune
View on GitHub
Finetune CPM-2
☆80Mar 18, 2023Updated 3 years ago
yxuansu / HCL
View on GitHub
[ACL'21] Dialogue Response Selection with Hierarchical Curriculum Learning
☆21Nov 15, 2022Updated 3 years ago
deepdialog / CPM-LM-TF2
View on GitHub
☆246Oct 21, 2022Updated 3 years ago
thu-coai / ConPer
View on GitHub
Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"
☆16Sep 1, 2022Updated 3 years ago
lemon234071 / clean-dialog
View on GitHub
A framework for cleaning Chinese dialog data
☆274May 14, 2021Updated 5 years ago
facebookresearch / task_bench
View on GitHub
The TaskBench500 dataset and code for generating tasks.
☆16Jul 16, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LinyangLee / Token-Aware-VAT
View on GitHub
Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.
☆25Dec 3, 2020Updated 5 years ago
yangjianxin1 / CPM
View on GitHub
Easy-to-use CPM for Chinese text generation（基于CPM的中文文本生成）
☆530Apr 10, 2023Updated 3 years ago
shoarora / transformers-trainers
View on GitHub
Tools for training pytorch language models
☆27Nov 14, 2020Updated 5 years ago
KuaiSearchPERKS / PERKS
View on GitHub
KuaiSearch PERKS
☆12Nov 16, 2021Updated 4 years ago
thu-coai / CDial-GPT
View on GitHub
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
☆1,957Jun 12, 2023Updated 3 years ago
thu-coai / EVA
View on GitHub
EVA: Large-scale Pre-trained Chit-Chat Models
☆304Mar 11, 2023Updated 3 years ago
danieldeutsch / summarize
View on GitHub
☆12Nov 11, 2019Updated 6 years ago
iseesaw / SMP-MCC2020
View on GitHub
Dataset and Baseline for SMP-MCC2020
☆23Jul 6, 2023Updated 3 years ago
BAAI-WuDao / P-tuning
View on GitHub
Finetune CPM-1
☆24Jun 20, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
thunlp / Knowledge-Inheritance
View on GitHub
Source code for paper: Knowledge Inheritance for Pre-trained Language Models
☆37Apr 24, 2022Updated 4 years ago
nusnlp / paraphrasing-squad
View on GitHub
Datasets for the paper "Improving the Robustness of Question Answering Systems to Question Paraphrasing" (ACL 2019)
☆27Aug 7, 2019Updated 6 years ago
princeton-nlp / LM-BFF
View on GitHub
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
☆727Aug 29, 2022Updated 3 years ago
bcmi220 / seq2seq_parser
View on GitHub
☆20Sep 23, 2018Updated 7 years ago
TsinghuaAI / CPM
View on GitHub
Introduction to CPM
☆164Sep 26, 2021Updated 4 years ago
XiaoyuanYi / StyIns
View on GitHub
The source code of Text Style Transfer via Learning Style Instance Supported Latent Space (IJCAI 2020).
☆38Dec 21, 2020Updated 5 years ago
CLAW-Lab / ToM
View on GitHub
Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"
☆18May 18, 2022Updated 4 years ago
chujiezheng / ChID-Dataset
View on GitHub
ChID: A Large-scale Chinese IDiom Dataset for Cloze Test
☆150May 8, 2023Updated 3 years ago
vikas95 / AIR-retriever
View on GitHub
AIR retriever for Multi-Hop QA (ACL 2020 paper)
☆30Jul 18, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
luciusssss / why-learn-shortcut
View on GitHub
[ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?
☆16Aug 8, 2023Updated 2 years ago
Yifan-Gao / multilingual_keyphrase_generation
View on GitHub
[NAACL'22-Findings] Dataset for "Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training"
☆18Sep 21, 2022Updated 3 years ago
qiufengyuyi / lear_ner_extraction
View on GitHub
using lear to do ner extraction
☆29Mar 13, 2022Updated 4 years ago
vasisouv / tweets-preprocessor
View on GitHub
Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team
☆26Dec 10, 2020Updated 5 years ago
lizekang / DSTC10-MOD
View on GitHub
DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog
☆51Feb 16, 2023Updated 3 years ago
ghosthamlet / gpt2-ml-torch
View on GitHub
Pytorch model for https://github.com/imcaspar/gpt2-ml
☆78Nov 21, 2021Updated 4 years ago
yhlleo / frechet-bert-distance
View on GitHub
Findings of ACL 2021
☆24May 8, 2021Updated 5 years ago