yongzhuo / gemma-sftLinks
Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)
☆33Updated last year
Alternatives and similar repositories for gemma-sft
Users that are interested in gemma-sft are comparing it to the libraries listed below
Sorting:
- 大语言模型训练和服务调研☆36Updated 2 years ago
- ☆170Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆69Updated last year
- Imitate OpenAI with Local Models☆89Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆217Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated 2 years ago
- 基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。☆26Updated 2 years ago
- 大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama☆28Updated 2 years ago
- Its an open source LLM based on MOE Structure.☆58Updated last year
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55Updated 2 years ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆141Updated last year
- qwen models finetuning☆105Updated 9 months ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- 一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。☆221Updated 2 years ago
- 千问14B和7B的逐行解释☆63Updated 2 years ago
- share data, prompt data , pretraining data☆36Updated 2 years ago
- 多轮共情对话模型PICA☆97Updated 2 years ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual fin…☆36Updated 5 months ago
- 在中文开源大模型的基础上进行定制化的微调,拥有自己专属的语言模型。☆51Updated 2 years ago
- 用于微调LLM的中文指令数据集☆28Updated 2 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Updated 2 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准☆145Updated last year
- LLaMA Factory Document☆159Updated last week
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆89Updated 2 years ago
- Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on☆99Updated last year
- Baichuan2代码的逐行解析版本,适合小白☆213Updated 2 years ago
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆62Updated 11 months ago