yongzhuo / gemma-sftLinks

Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)

☆31

Alternatives and similar repositories for gemma-sft

Users that are interested in gemma-sft are comparing it to the libraries listed below

Sorting:

shuyhere / all-about-llm
大语言模型训练和服务调研
☆37Updated last year
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆63Updated last year
yeyupiaoling / Chinese-LLM-Chat
大语言模型微调的项目，包含了使用QLora微调ChatGLM和LLama
☆27Updated 2 years ago
zejunwang1 / chatglm_tuning
基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调
☆55Updated 2 years ago
wjn1996 / ChatGLM2-Tuning
基于ChatGLM2-6B进行微调，包括全参数、参数有效性、量化感知训练等，可实现指令微调、多轮对话微调等。
☆25Updated last year
mobvoi / seq-monkey-data
☆146Updated last year
wellinxu / LLM_Custome
在中文开源大模型的基础上进行定制化的微调，拥有自己专属的语言模型。
☆47Updated 2 years ago
airaria / GRAIN
GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models
☆19Updated last year
yongzhuo / ChatGLM2-SFT
ChatGLM2-6B微调, SFT/LoRA, instruction finetune
☆108Updated last year
zysNLP / quickllm
A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …
☆46Updated last week
basicv8vc / chinese-instruction-datasets-for-llms
用于微调LLM的中文指令数据集
☆26Updated 2 years ago
taishan1994 / qlora-chinese-LLM
使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
☆87Updated 2 years ago
lansinuote / Simple_LLM_DPO
☆70Updated last year
ChaosWang666 / Ziya-LLaMA-13B-deployment
Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型，具备翻译，编程，文本分类，信息抽取，摘要，文案生成，常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…
☆45Updated 2 years ago
MetaGLM / OpenLM
本项目致力于为大模型领域的初学者提供全面的知识体系，包括基础和高阶内容，以便开发者能迅速掌握大模型技术栈并全面了解相关知识。
☆61Updated 5 months ago
Academic-Hammer / HammerLLM
1.4B sLLM for Chinese and English - HammerLLM🔨
☆44Updated last year
ssbuild / llm_finetuning
Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on
☆97Updated last year
NEU-DataMining / PICA
多轮共情对话模型PICA
☆95Updated last year
llm-factory / imitater
Imitate OpenAI with Local Models
☆86Updated 10 months ago
MikeGu721 / EasyLLM
make LLM easier to use
☆59Updated last year
cwxndl / LLM
大语言模型应用：RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛
☆62Updated 4 months ago
xverse-ai / XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
☆139Updated last year
open-chinese / alpaca-chinese-dataset
Alpaca Chinese Dataset -- 中文指令微调数据集
☆208Updated 8 months ago
Pillars-Creation / ChatGLM-RLHF-LoRA-RM-PPO
ChatGLM-6B添加了RLHF的实现，以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成，以及指定context推荐的RLHF的实现
☆86Updated last year
thu-coai / CritiqueLLM
☆142Updated 11 months ago
KongLongGeFDU / TransferTOD
The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"
☆20Updated 6 months ago
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆66Updated 2 years ago
jiahe7ay / infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…
☆57Updated last year
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
MikeGu721 / AgentGroup
☆91Updated last year