bobo0810 / LearnDeepSpeedLinks

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

☆183

Alternatives and similar repositories for LearnDeepSpeed

Users that are interested in LearnDeepSpeed are comparing it to the libraries listed below

Sorting:

wdndev / mllm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师多模态相关知识
☆252Updated last year
AI-Study-Han / Zero-Qwen-VL
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
☆77Updated last year
xxcheng0708 / pytorch-model-train-template
pytorch单精度、半精度、混合精度、单卡、多卡（DP / DDP）、FSDP、DeepSpeed模型训练代码，并对比不同方法的训练速度以及GPU内存的使用
☆127Updated last year
qingkelab / qingketalk
青稞Talk
☆169Updated 2 weeks ago
step-law / steplaw
☆207Updated last month
Victorwz / Open-Qwen2VL
[COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources
☆287Updated 3 months ago
Coobiw / MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…
☆521Updated 8 months ago
GAIR-NLP / MAYE
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
☆145Updated 7 months ago
OpenRLHF / OpenRLHF-M
An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.
☆149Updated 2 months ago
swordlidev / Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey
☆376Updated 7 months ago
yfzhang114 / Awesome-Multimodal-Large-Language-Models
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
☆791Updated this week
Glanvery / LLM-Travel
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆354Updated last year
liujunwen23 / MIRE
WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge
☆129Updated last year
yuanzhoulvpi2017 / vscode_debug_transformers
☆400Updated 9 months ago
chunhuizhang / llm_rl
llm & rl
☆256Updated last month
OvJat / DeepSpeedTutorial
DeepSpeed Tutorial
☆104Updated last year
datawhalechina / sora-tutorial
☆103Updated last year
chunhuizhang / pytorch_distribute_tutorials
pytorch distribute tutorials
☆160Updated 5 months ago
yongliang-wu / DFT
[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.
☆503Updated last month
liangyuwang / zo2
ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]
☆196Updated 4 months ago
Ablustrund / LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
☆386Updated last year
hhaAndroid / awesome-mm-chat
多模态 MM +Chat 合集
☆279Updated 3 months ago
HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆271Updated 9 months ago
hengjiUSTC / learn-llm
☆115Updated last year
a-m-team / a-m-models
a-m-team's exploration in large language modeling
☆194Updated 6 months ago
RethinkFun / trian_ppo
☆127Updated last year
RLHF-V / RLAIF-V
[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
☆425Updated 6 months ago
taishan1994 / llava-handbook
对llava官方代码的一些学习笔记
☆28Updated last year
Outsider565 / LoRA-GA
☆215Updated last week
sunkx109 / llama
Inference code for LLaMA models
☆128Updated 2 years ago