owenliang / pytorch-transformerLinks
pytorch复现transformer
☆79Updated last year
Alternatives and similar repositories for pytorch-transformer
Users that are interested in pytorch-transformer are comparing it to the libraries listed below
Sorting:
- vision transformer on mnist dataset☆35Updated last year
- pytorch复现stable diffusion☆175Updated last year
- Diffusion Transformers (DiTs) trained on MNIST dataset☆116Updated last year
- Qwen2.5 0.5B GRPO☆51Updated 4 months ago
- 童发发的大模型学习之旅☆87Updated this week
- pytorch distribute tutorials☆138Updated last week
- 大模型/LLM推理和部署理论与实践☆278Updated 3 months ago
- ☆337Updated 4 months ago
- TinyRAG☆307Updated last week
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆437Updated 4 months ago
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆295Updated 5 months ago
- DeepSpeed Tutorial☆97Updated 10 months ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆102Updated last year
- ☆79Updated 10 months ago
- Huggingface transformers的中文文档☆254Updated last year
- ☆205Updated last month
- personal chatgpt☆373Updated 6 months ago
- modern AI for beginners☆141Updated 2 weeks ago
- 从0开始,将chatgpt的技术路线跑一遍。☆241Updated 9 months ago
- 通义千问的DPO训练☆49Updated 9 months ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆182Updated this week
- everything about llm & aigc☆70Updated last week
- llm & rl☆151Updated this week
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆154Updated 8 months ago
- Inference code for LLaMA models☆121Updated last year
- ☆164Updated last year
- 包含程序员面试大厂面试题和面试经验☆138Updated last month
- llm相关内容,包括:基础知识、八股文、面经、经典论文☆139Updated last year
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆205Updated last year
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆120Updated 7 months ago