wxc971231 / ddp-tutorial-seriesLinks
Follow the pytorch tutorial tutorial to learn how to use nn.parallel.DistributedDataParallel to speed up training
☆10Updated 9 months ago
Alternatives and similar repositories for ddp-tutorial-series
Users that are interested in ddp-tutorial-series are comparing it to the libraries listed below
Sorting:
- ☆391Updated 8 months ago
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆216Updated last year
- Huggingface transformers的中文文档☆273Updated last year
- 基于InternLM2大模型的离线具身智能导盲犬☆105Updated last year
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆125Updated 11 months ago
- ☆107Updated last year
- Qwen3 Fine-tuning: Medical R1 Style Chat☆202Updated 4 months ago
- ☆119Updated last year
- 通义千问VLLM推理部署DEMO☆611Updated last year
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆881Updated 3 weeks ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆393Updated last month
- Train a 1B LLM with 1T tokens from scratch by personal☆740Updated 5 months ago
- Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过 这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Tran…☆281Updated last year
- MoE model with onnx runtime☆55Updated last year
- 简单易理解的代码,用于在qwen上使用grpo加强数学能力☆38Updated 5 months ago
- ☆1,080Updated last month
- personal chatgpt☆386Updated 10 months ago
- Inference code for LLaMA models☆125Updated 2 years ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆474Updated 5 months ago
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆599Updated 2 years ago
- PyTorch自动驾驶视觉感知算法实战示例代码☆74Updated last year
- How to use wandb?☆680Updated 2 years ago
- 《ChatGPT原理与实战:大型语 言模型的算法、技术和私有化》☆368Updated last year
- MindSpore online courses: Step into LLM☆476Updated last month
- 利用HuggingFace的官方下载工具从镜像网站进行高速下载。☆1,220Updated last year
- 多模态中文LLaMA&Alpaca大语言模型(VisualCLA)☆454Updated 2 years ago
- 一些 LLM 方面的从零复现笔记☆224Updated 5 months ago
- ☆15Updated last year
- DeepSpeed Tutorial☆102Updated last year
- 从0开始,将chatgpt的技术路线跑一遍。☆264Updated last year