owenliang / pytorch-transformer
pytorch复现transformer
☆77Updated last year
Alternatives and similar repositories for pytorch-transformer:
Users that are interested in pytorch-transformer are comparing it to the libraries listed below
- pytorch复现stable diffusion☆168Updated last year
- vision transformer on mnist dataset☆31Updated last year
- Diffusion Transformers (DiTs) trained on MNIST dataset☆107Updated last year
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆120Updated 5 months ago
- Qwen2.5 0.5B GRPO☆42Updated 2 months ago
- ☆71Updated 8 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆185Updated 11 months ago
- A simple deep learning framework inspired by Dezero and PyTorch☆29Updated 3 months ago
- modern AI for beginners☆128Updated 3 weeks ago
- DeepSpeed Tutorial☆96Updated 8 months ago
- ☆135Updated last week
- 通义千问的DPO训练☆47Updated 7 months ago
- pytorch distribute tutorials☆126Updated last week
- ☆318Updated 2 months ago
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆250Updated 4 months ago
- 一系列文生图模型概念讲解及代码实现☆64Updated 6 months ago
- 包含程序员面试大厂面试题和面试经验☆126Updated 4 months ago
- LLM Tokenizer with BPE algorithm☆31Updated last year
- personal chatgpt☆362Updated 4 months ago
- 一个很小很小的RAG系统☆215Updated last week
- 一些大语言模型和多模态模型的应用,主要包括小模型,Agent,跨模态搜索,OCR、RAG、ChatBot等等☆166Updated last week
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆98Updated last year
- 大模型/LLM推理和部署理论与实践☆250Updated last month
- 通义千问VLLM推理部署DEMO☆571Updated last year
- Huggingface transformers的中文文档☆236Updated last year
- Inference code for LLaMA models☆120Updated last year
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆379Updated 2 months ago
- 通义千问 SFT试验☆69Updated last year
- 从0开始,将chatgpt的技术路线跑一遍。☆232Updated 8 months ago
- Learning LLM Implementaion and Theory for Practical Landing☆151Updated 4 months ago