owenliang / pytorch-transformerLinks

pytorch复现transformer

☆80

Alternatives and similar repositories for pytorch-transformer

Users that are interested in pytorch-transformer are comparing it to the libraries listed below

Sorting:

owenliang / pytorch-diffusion
pytorch复现stable diffusion
☆181Updated 2 years ago
bbruceyuan / AI-Interview-Code
LLM大模型（重点）以及搜广推等 AI 算法中手写的面试题，（非 LeetCode），比如 Self-Attention, AUC等，一般比 LeetCode 更考察一个人的综合能力，又更贴近业务和基础知识一点
☆330Updated 7 months ago
liuzard / transformers_zh_docs
Huggingface transformers的中文文档
☆267Updated last year
TongTong313 / LLM-TT
童发发的大模型学习之旅
☆109Updated 3 weeks ago
chunhuizhang / pytorch_distribute_tutorials
pytorch distribute tutorials
☆143Updated last month
mindspore-courses / step_into_llm
MindSpore online courses: Step into LLM
☆475Updated 3 weeks ago
owenliang / mnist-vit
vision transformer on mnist dataset
☆34Updated last year
owenliang / mnist-dits
Diffusion Transformers (DiTs) trained on MNIST dataset
☆122Updated last year
yuanzhoulvpi2017 / vscode_debug_transformers
☆361Updated 5 months ago
chunhuizhang / personal_chatgpt
personal chatgpt
☆380Updated 7 months ago
RethinkFun / LLM
☆88Updated 11 months ago
qiufengqijun / mini_qwen
这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。
☆536Updated 5 months ago
AI-Study-Han / Zero-Chatgpt
从0开始，将chatgpt的技术路线跑一遍。
☆247Updated 11 months ago
OvJat / DeepSpeedTutorial
DeepSpeed Tutorial
☆100Updated 11 months ago
wdndev / tiny-llm-zh
从零实现一个小参数量中文大语言模型。
☆758Updated 11 months ago
intro-llm / intro-llm-code
☆261Updated 3 months ago
zhanshijinwat / Steel-LLM
Train a 1B LLM with 1T tokens from scratch by personal
☆707Updated 3 months ago
KMnO4-zx / TinyRAG
TinyRAG
☆317Updated last month
wenjtop / transformer
Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型，经过这些年的大量的工业使用和论文验证，在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例，来解释Tran…
☆271Updated last year
lansinuote / Diffusion_From_Scratch
☆165Updated last year
owenliang / qwen2.5-0.5b-grpo
Qwen2.5 0.5B GRPO
☆58Updated 5 months ago
datawhalechina / awesome-compression
模型压缩的小白入门教程，PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases
☆310Updated last month
owenliang / qwen-dpo
通义千问的DPO训练
☆51Updated 10 months ago
wdndev / mllm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师多模态相关知识
☆219Updated last year
owenliang / qwen-vllm
通义千问VLLM推理部署DEMO
☆592Updated last year
datawhalechina / llm-deploy
大模型/LLM推理和部署理论与实践
☆304Updated 3 weeks ago
LDLINGLINGLING / adan_application
一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR
☆185Updated last week
Tongjilibo / build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
☆461Updated 4 months ago
waylandzhang / DiT_from_scratch
一系列文生图模型概念讲解及代码实现
☆80Updated 9 months ago
liujunwen23 / MIRE
WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge
☆122Updated 8 months ago