Tsai-chasel / training-methods-tutorialLinks
☆12Updated 2 years ago
Alternatives and similar repositories for training-methods-tutorial
Users that are interested in training-methods-tutorial are comparing it to the libraries listed below
Sorting:
- DeepSpeed Tutorial☆106Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆129Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆172Updated 3 months ago
- [ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text☆412Updated 9 months ago
- 多模态 MM +Chat 合集☆282Updated 5 months ago
- Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…☆597Updated 10 months ago
- Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models☆910Updated last week
- ☆22Updated 10 months ago
- ☆716Updated this week
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆186Updated 2 years ago
- ☆84Updated 9 months ago
- Official implementation of ICLR 2026 paper "Urban Socio-Semantic Segmentation with Vision-Language Reasoning"☆155Updated last week
- ☆44Updated last year
- [NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Langu…☆267Updated 3 months ago
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model☆86Updated 6 months ago
- ☆36Updated last year
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆266Updated last year
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆928Updated 2 months ago
- Collect the awesome works evolved around reasoning models like O1/R1 in visual domain☆53Updated 6 months ago
- ☆1,112Updated 2 months ago
- ☆27Updated last month
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆54Updated 3 months ago
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆268Updated 3 weeks ago
- [MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models☆290Updated 6 months ago
- The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".☆252Updated 2 years ago
- ☆41Updated last year
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆109Updated last year
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆47Updated 6 months ago
- [COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆306Updated 5 months ago
- Collection of image and video datasets for generative AI and multimodal visual AI☆33Updated last year