dourgey / qwen2_moe_mergekit
根据Qwen2(Qwen1.5)模型生成qwen2 MoE模型的工具
☆13Updated 10 months ago
Alternatives and similar repositories for qwen2_moe_mergekit:
Users that are interested in qwen2_moe_mergekit are comparing it to the libraries listed below
- 怎么训练一个LLM分词器☆140Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆64Updated last year
- ☆104Updated 3 months ago
- 使用单个24G显卡,从0开始训练LLM☆50Updated 3 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆74Updated 3 months ago
- qwen-7b and qwen-14b finetuning☆90Updated 10 months ago
- ☆102Updated 7 months ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆42Updated this week
- deepspeed+trainer简单高效实现多卡微调大模型☆122Updated last year
- 对llama3进行全参微调、lora微调以及qlora微调。☆176Updated 4 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆169Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆105Updated last year
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆49Updated last month
- pytorch分布式训练☆63Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆26Updated 7 months ago
- deep learning☆150Updated 8 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆56Updated last week
- ☆84Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated 9 months ago
- 欢迎 来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆298Updated 7 months ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆116Updated last year
- 基于DPO算法微调语言大模型,简单好上手。☆30Updated 7 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆53Updated 9 months ago
- 阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理☆80Updated 9 months ago
- LLM+RAG for QA☆21Updated last year
- baichuan LLM surpervised finetune by lora☆62Updated last year
- ☆15Updated 10 months ago
- code for piccolo embedding model from SenseTime☆119Updated 9 months ago
- 通用简单工具项目☆15Updated 4 months ago