Hansen06 / GPT2-ChineseLinks
Chinese version of GPT2 training code, using BERT tokenizer.
☆37Updated 3 years ago
Alternatives and similar repositories for GPT2-Chinese
Users that are interested in GPT2-Chinese are comparing it to the libraries listed below
Sorting:
- 骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技☆722Updated 2 years ago
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆968Updated 11 months ago
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆360Updated last year
- PromptCLUE, 全中文任务支持零样本学习模型☆666Updated 2 years ago
- pCLUE: 1000000+多任务提示学习数据集☆500Updated 2 years ago
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,894Updated 2 years ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,187Updated 3 months ago
- ChatGLM-6B 指令学习|指令数据|Instruct☆654Updated 2 years ago
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,079Updated last year
- 探索中文instruct数据在ChatGLM, LLaMA上的微调表现☆391Updated 2 years ago
- chatglm多gpu用deepspeed和☆409Updated last year
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆400Updated last year
- dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识…☆335Updated last year
- chatglm 6b finetuning and alpaca finetuning☆1,544Updated 5 months ago
- 开源SFT数据集整理,随时补充☆533Updated 2 years ago
- Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)☆720Updated 2 years ago
- A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.☆371Updated last year
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆611Updated 6 months ago
- [COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集☆637Updated 2 years ago
- ☆523Updated last year
- 基于ChatGLM-6B的中文问诊模型☆821Updated last year
- unified embedding model☆866Updated last year
- 活字通用大模型☆393Updated 11 months ago
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,010Updated last year
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆412Updated last year
- Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)☆535Updated 2 years ago
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆196Updated 2 years ago
- Implementation of Chinese ChatGPT☆287Updated last year
- 夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答…☆350Updated 2 weeks ago
- Mengzi Pretrained Models☆537Updated 2 years ago