hans0809/MiniMind-in-Depth

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hans0809/MiniMind-in-Depth)

hans0809 / MiniMind-in-Depth

轻量级大语言模型MiniMind的源码解读，包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程

☆1,117

Alternatives and similar repositories for MiniMind-in-Depth

Users that are interested in MiniMind-in-Depth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tongyun1 / from-minimind-to-more
View on GitHub
🎓从0开始训练一个大模型Minimind项目的超详细解析，包括但不限于用到的架构，算法，以及大模型面试经验
☆1,010May 25, 2026Updated 2 months ago
tomatoyuan / minimind-learn
View on GitHub
从零复现 minimind👉minimind-v
☆371Dec 24, 2025Updated 7 months ago
jingyaogong / minimind
View on GitHub
🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!
☆53,838Updated this week
jingyaogong / minimind-v
View on GitHub
👀「大模型」2小时从0训练65M参数的视觉多模态VLM！Train a 65M-parameter VLM from scratch in just 2h!
☆8,365Jun 28, 2026Updated 3 weeks ago
joyehuang / minimind-notes
View on GitHub
🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building…
☆160Jun 4, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bcefghj / learn-minimind
View on GitHub
📖 从零基础到面试通关 —— 22节课彻底搞懂大语言模型 | Learn MiniMind: 系统化学习LLM训练全流程
☆458Apr 1, 2026Updated 3 months ago
Nijikadesu / breakdown-minimind
View on GitHub
Use interactive notebook to break down MiniMind code and learn from scratch.
☆155Jan 7, 2026Updated 6 months ago
Alic-Li / Mini_RWKV_7
View on GitHub
Mini_RWKV_V7_LM Only 34.2M params (also have RWKV7s architecture [deep embedding]/[deep embedding attention) with Full Training code & da…
☆90Jan 26, 2026Updated 6 months ago
fzkun / minimind-ascend
View on GitHub
基于Ascend（昇腾910B）纯国产显卡复刻MiniMind，🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!
☆25Mar 2, 2026Updated 4 months ago
Wood-Q / MokioMind
View on GitHub
三元三小时手敲大模型
☆542Mar 12, 2026Updated 4 months ago
datawhalechina / happy-llm
View on GitHub
📚 从零开始构建大模型
☆32,338May 6, 2026Updated 2 months ago
shibing624 / MedicalGPT
View on GitHub
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
☆5,656Jun 3, 2026Updated last month
yuandaxia2001 / HealthAI-2025
View on GitHub
☆170Mar 18, 2026Updated 4 months ago
wdndev / llm_interview_note
View on GitHub
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
☆14,759Jun 14, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bcefghj / learn-minimind-multimodal
View on GitHub
MiniMind-V 多模态面试学习指南 - 20节课程 + 278道面试题 + STAR面试稿 + 哆啦A梦漫画
☆133Apr 2, 2026Updated 3 months ago
Bader-CN / Note-for-LLM-Training
View on GitHub
一个完整的 LLM 训练的基本流程笔记 (Tokenizer -> PreTraining -> SFT -> DPO -> GRPO)
☆27Feb 23, 2026Updated 5 months ago
datawhalechina / tiny-universe
View on GitHub
《大模型白盒子构建指南》：一个全手搓的Tiny-Universe
☆4,979Feb 12, 2026Updated 5 months ago
jingyaogong / minimind-o
View on GitHub
🎙️ 「大模型」从0训练0.1B能听能说能看的全模态Omni模型！A 0.1B Omni model trained from scratch, capable of listening, speaking, and seeing!
☆2,184Jun 28, 2026Updated 3 weeks ago
bcefghj / learn-cs336
View on GitHub
CS336 面试导向学习指南 - Stanford Language Modeling from Scratch
☆124Apr 2, 2026Updated 3 months ago
datawhalechina / self-llm
View on GitHub
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
☆31,416Jul 15, 2026Updated last week
datawhalechina / all-in-rag
View on GitHub
🔍大模型应用开发实战一：RAG 技术全栈指南，在线阅读地址：https://datawhalechina.github.io/all-in-rag/
☆9,781Updated this week
ckd0817 / LLM-Interview-Code
View on GitHub
☆736Mar 26, 2026Updated 4 months ago
datawhalechina / hello-agents
View on GitHub
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
☆68,564Jul 17, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
luhengshiwo / LLMForEverybody
View on GitHub
每个人都能看懂的大模型知识分享，LLMs春/秋招大模型面试前必看，让你和面试官侃侃而谈
☆7,017May 31, 2026Updated last month
qibin0506 / Cortex
View on GitHub
从零构建大模型：从预训练到RLHF的完整实践
☆2,679May 20, 2026Updated 2 months ago
bcefghj / learn-MedicalGPT
View on GitHub
🏥 从零基础到面试通关：20节课彻底搞懂MedicalGPT医疗大模型训练全流程 | PT/SFT/LoRA/RLHF/DPO/GRPO | 100+面试高频考点
☆189Apr 1, 2026Updated 3 months ago
liguodongiot / llm-action
View on GitHub
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
☆24,800Updated this week
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,153Nov 13, 2025Updated 8 months ago
AkaliKong / MiniOneRec
View on GitHub
Minimal reproduction of OneRec
☆1,711May 14, 2026Updated 2 months ago
qiufengqijun / mini_qwen
View on GitHub
这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。
☆863Feb 18, 2025Updated last year
Junvate / LLM-Algorithm-Intern-Guide
View on GitHub
🚀 2026届大模型算法岗实习面经 | 包含 DeepSeek/Qwen 技术报告解析、手撕 PPO/RoPE/Transformer、RLHF 核心与八股文 | 持续更新中...
☆615Mar 28, 2026Updated 3 months ago
forXuyx / Cinego
View on GitHub
🚀 轻量视频🎥 大模型🤖
☆23Apr 27, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
datawhalechina / diy-llm
View on GitHub
🎓 系统性大语言模型构建课程｜🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)｜🚀 6 个渐进式作业 + 代码驱…
☆1,073Updated this week
weiruihhh / cs336_note_and_hw
View on GitHub
记录我在cs336学习时的笔记和作业
☆1,018May 2, 2026Updated 2 months ago
hk011 / yanxi-paper-note
View on GitHub
AI拆解论文，人人都能读懂前沿研究
☆18Jul 10, 2026Updated 2 weeks ago
shareAI-lab / learn-claude-code
View on GitHub
Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1
☆72,198Jun 26, 2026Updated 3 weeks ago
wyf3 / llm_related
View on GitHub
复现大模型相关算法及一些学习记录
☆3,466Jul 2, 2026Updated 3 weeks ago
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆14,635Apr 26, 2026Updated 3 months ago
changyeyu / LLM-RL-Visualized
View on GitHub
🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）
☆4,702Jul 16, 2026Updated last week