xxcheng0708 / pytorch-model-train-templateLinks

pytorch单精度、半精度、混合精度、单卡、多卡（DP / DDP）、FSDP、DeepSpeed模型训练代码，并对比不同方法的训练速度以及GPU内存的使用

☆113

Alternatives and similar repositories for pytorch-model-train-template

Users that are interested in pytorch-model-train-template are comparing it to the libraries listed below

Sorting:

bobo0810 / LearnDeepSpeed
DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）
☆171Updated last year
godweiyang / GrabGPU
一款便捷的抢占显卡脚本
☆341Updated 6 months ago
OpenDocCN / python-code-anls
☆43Updated 6 months ago
wdndev / mllm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师多模态相关知识
☆219Updated last year
OvJat / DeepSpeedTutorial
DeepSpeed Tutorial
☆100Updated 11 months ago
yfzhang114 / Awesome-Multimodal-Large-Language-Models
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
☆528Updated 3 weeks ago
Victorwz / Open-Qwen2VL
[COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources
☆244Updated 2 months ago
hhaAndroid / awesome-mm-chat
多模态 MM +Chat 合集
☆273Updated 2 months ago
3017218062 / Pytorch-Lightning-Learning
Pytorch Lightning入门中文教程，转载请注明来源。（当初是写着玩的，建议看完MNIST这个例子再上手）
☆221Updated 4 years ago
serend1p1ty / core-pytorch-utils
Yet another PyTorch Trainer and some core components for deep learning.
☆221Updated last year
Tramac / paper-reading-note
和李沐一起读论文
☆207Updated last month
Outsider565 / LoRA-GA
☆204Updated 9 months ago
kyegomez / NaViT
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
☆249Updated 2 weeks ago
bubbliiiing / DiT-pytorch
这是一个DiT-pytorch的代码，主要用于学习DiT结构。
☆78Updated last year
LMM101 / Awesome-Multimodal-Next-Token-Prediction
[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
☆446Updated 6 months ago
BIGBALLON / distribuuuu
The pure and clear PyTorch Distributed Training Framework.
☆275Updated last year
justchenhao / ChatDailyPapers
Build a daily academic subscription pipeline! Get daily Arxiv papers and corresponding chatGPT summaries with pre-defined keywords. It is…
☆41Updated 2 years ago
rentainhe / pytorch-distributed-training
Simple tutorials on Pytorch DDP training
☆281Updated 2 years ago
Kwai-Keye / Keye
☆491Updated last week
Meituan-AutoML / VisionLLaMA
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
☆386Updated last year
HongxinXiang / pytorch-multi-GPU-training-tutorial
☆69Updated 2 years ago
swordlidev / Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey
☆362Updated 3 months ago
bojone / papers.cool
Cool Papers - Immersive Paper Discovery
☆584Updated 2 months ago
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆153Updated last month
x-cls / superclass
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training
☆211Updated 4 months ago
KaihuaTang / LLM-TP-Inference-on-910B
本项目提供了基于910B的huggingface LLM模型的Tensor Parallel(TP)部署教程，同时也可以作为一份极简的TP学习代码。
☆27Updated 10 months ago
lichao-sun / SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision M…
☆497Updated last year
kyegomez / AttentionIsOFFByOne
Implementation of "Attention Is Off By One" by Evan Miller
☆193Updated last year
OpenRL-Lab / Wandb_Tutorial
How to use wandb?
☆668Updated last year
KaiiZhang / DDP-Tutorial
☆64Updated 3 years ago