Hansen06 / GPT2-ChineseLinks

Chinese version of GPT2 training code, using BERT tokenizer.

☆37

Alternatives and similar repositories for GPT2-Chinese

Users that are interested in GPT2-Chinese are comparing it to the libraries listed below

Sorting:

shibing624 / textgen
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型，实现了包括LLaMA，ChatGLM，BLO…
☆981Updated last year
shuxueslpi / chatGLM-6B-QLoRA
使用peft库，对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调，并做lora model和base model的merge及4bit的量化（quantize）。
☆358Updated 2 years ago
CLUEbenchmark / pCLUE
pCLUE: 1000000+多任务提示学习数据集
☆506Updated 3 years ago
yangjianxin1 / CPM
Easy-to-use CPM for Chinese text generation（基于CPM的中文文本生成）
☆531Updated 2 years ago
LC1332 / Chinese-alpaca-lora
骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
☆718Updated 2 years ago
clue-ai / PromptCLUE
PromptCLUE, 全中文任务支持零样本学习模型
☆665Updated 2 years ago
hikariming / chat-dataset-baseline
人工精调的中文对话数据集和一段chatglm的微调代码
☆1,196Updated 9 months ago
27182812 / ChatGLM-LLaMA-chinese-insturct
探索中文instruct数据在ChatGLM, LLaMA上的微调表现
☆389Updated 2 years ago
yanqiangmiffy / InstructGLM
ChatGLM-6B 指令学习|指令数据|Instruct
☆655Updated 2 years ago
jianzhnie / LLamaTuner
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
☆619Updated last year
HIT-SCIR / huozi
活字通用大模型
☆391Updated last year
core-power / Chinese_Chat_T5_Base
中文聊天小模型，用t5 base在大量数据上有监督。
☆103Updated 2 years ago
SpongebBob / Finetune-ChatGLM2-6B
ChatGLM2-6B 全参数微调，支持多轮对话的高效微调。
☆402Updated 2 years ago
ssbuild / chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning
☆1,538Updated 11 months ago
CLUEbenchmark / CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
☆997Updated last week
liangwq / Chatglm_lora_multi-gpu
chatglm多gpu用deepspeed和
☆409Updated last year
ydli-ai / CSL
[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集
☆660Updated 2 years ago
thu-coai / CDial-GPT
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
☆1,933Updated 2 years ago
sunzeyeah / RLHF
Implementation of Chinese ChatGPT
☆288Updated 2 years ago
lich99 / ChatGLM-finetune-LoRA
Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)
☆719Updated 2 years ago
Tencent / TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
☆1,092Updated last year
Langboat / Mengzi
Mengzi Pretrained Models
☆540Updated 3 years ago
yangjianxin1 / CLIP-Chinese
中文CLIP预训练模型
☆422Updated 3 years ago
chaoswork / sft_datasets
开源SFT数据集整理,随时补充
☆571Updated 2 years ago
chaoyi-wu / Finetune_LLAMA
简单易懂的LLaMA微调指南。
☆415Updated 2 years ago
yangjianxin1 / Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
☆416Updated 2 years ago
carbonz0 / alpaca-chinese-dataset
alpaca中文指令微调数据集
☆397Updated 2 years ago
shibing624 / dialogbot
dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人，基于问答型对话、任务型对话、聊天型对话等模型实现，支持网络检索问答，领域知识…
☆332Updated last year
Miraclemarvel55 / ChatGLM-RLHF
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
☆198Updated 2 years ago
beyondguo / LLM-Tuning
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
☆1,019Updated last year