iissy / transformerLinks
transformer 源码实现
☆27Updated last year
Alternatives and similar repositories for transformer
Users that are interested in transformer are comparing it to the libraries listed below
Sorting:
- 从零到一实现一个 miniLLM~(动手学习LLM)☆77Updated last year
- 《大模型项目实战:多领域智能应用开发》配套资源☆220Updated 2 months ago
- 一些 LLM 方面的从零复现笔记☆243Updated 9 months ago
- Huggingface transformers的中文文档☆293Updated 2 years ago
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆230Updated 2 years ago
- 从0开始,将chatgpt的技术路线跑一遍。☆272Updated last year
- 通义千问的DPO训练☆61Updated last year
- ☆136Updated last year
- 通义千问VLLM推理部署DEMO☆638Updated last year
- 大模型/LLM推理和部署理论与实践☆374Updated 6 months ago
- 人工智能培训课件资源☆148Updated 2 months ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆495Updated 9 months ago
- ☆110Updated 7 months ago
- Qwen3 Fine-tuning: Medical R1 Style Chat☆277Updated 8 months ago
- ☆223Updated 4 years ago
- Train a 1B LLM with 1T tokens from scratch by personal☆787Updated 9 months ago
- ☆129Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆371Updated last year
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆526Updated 10 months ago
- qwen ai agent☆147Updated last year
- ☆59Updated 11 months ago
- 大语言模型微调,Qwen2VL、Qwen2、GLM4指令微调☆600Updated 8 months ago
- bilibili video course src code☆424Updated 2 years ago
- DeepSeek 系列工作解读、扩展和复现。☆700Updated 10 months ago
- LLM Tokenizer with BPE algorithm☆47Updated last year
- 阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理☆136Updated last year
- TinyRAG☆413Updated 7 months ago
- A Transformer Framework Based Translation Task☆159Updated 8 months ago
- RAG向量召回示例☆149Updated last year
- 尝试自己从头写一个LLM,参考llama和nanogpt☆68Updated last year