joyehuang/minimind-notes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/joyehuang/minimind-notes)

joyehuang / minimind-notes

🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.

☆158

Alternatives and similar repositories for minimind-notes

Users that are interested in minimind-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hans0809 / MiniMind-in-Depth
View on GitHub
轻量级大语言模型MiniMind的源码解读，包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程
☆1,117Jun 16, 2025Updated last year
Tongyun1 / from-minimind-to-more
View on GitHub
🎓从0开始训练一个大模型Minimind项目的超详细解析，包括但不限于用到的架构，算法，以及大模型面试经验
☆1,009May 25, 2026Updated 2 months ago
tomatoyuan / minimind-learn
View on GitHub
从零复现 minimind👉minimind-v
☆371Dec 24, 2025Updated 7 months ago
jingyaogong / minimind
View on GitHub
🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!
☆53,816Updated this week
jingyaogong / minimind-v
View on GitHub
👀「大模型」2小时从0训练65M参数的视觉多模态VLM！Train a 65M-parameter VLM from scratch in just 2h!
☆8,361Jun 28, 2026Updated 3 weeks ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
listen0425 / Safety-Layers
View on GitHub
code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)
☆25Apr 26, 2025Updated last year
tzzp1224 / RepoReaper
View on GitHub
☆95May 4, 2026Updated 2 months ago
bcefghj / learn-MedicalGPT
View on GitHub
🏥 从零基础到面试通关：20节课彻底搞懂MedicalGPT医疗大模型训练全流程 | PT/SFT/LoRA/RLHF/DPO/GRPO | 100+面试高频考点
☆188Apr 1, 2026Updated 3 months ago
Hoar012 / TDC-Video
View on GitHub
Official implementation of TDC.
☆15Jul 22, 2025Updated last year
Wood-Q / MokioMind
View on GitHub
三元三小时手敲大模型
☆541Mar 12, 2026Updated 4 months ago
bcefghj / learn-minimind-multimodal
View on GitHub
MiniMind-V 多模态面试学习指南 - 20节课程 + 278道面试题 + STAR面试稿 + 哆啦A梦漫画
☆132Apr 2, 2026Updated 3 months ago
EthanLiu6 / LLM_knowledge
View on GitHub
- 【LLM面经】大模型实习面试指南。手撕代码、面经经验、思考题等。初学者学习ing......欢迎指正错误
☆35Nov 11, 2025Updated 8 months ago
linglingxiansen / SpatialSKy
View on GitHub
☆36Feb 26, 2026Updated 4 months ago
yewzijian / MultiReg
View on GitHub
Learning Iterative Robust Transformation Synchronization
☆15Nov 29, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BRZ911 / Wrong-of-Thought
View on GitHub
[EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information
☆13Oct 1, 2024Updated last year
WarlockWendell / AggDet
View on GitHub
official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation
☆13Apr 15, 2024Updated 2 years ago
Escapist-coder / OpenVLA-Libero-Reproduction-Finetune
View on GitHub
☆21Mar 12, 2026Updated 4 months ago
Junvate / LLM-Algorithm-Intern-Guide
View on GitHub
🚀 2026届大模型算法岗实习面经 | 包含 DeepSeek/Qwen 技术报告解析、手撕 PPO/RoPE/Transformer、RLHF 核心与八股文 | 持续更新中...
☆615Mar 28, 2026Updated 3 months ago
VisionXLab / avi-math
View on GitHub
[ISPRS'25] Multimodal Mathematical Reasoning Embedded in Aerial Vehicle Imagery: Benchmarking, Analysis, and Exploration
☆18Jan 4, 2026Updated 6 months ago
tiaoyu1122 / TiaoYu-1
View on GitHub
For People! For Freedom!
☆151Aug 14, 2025Updated 11 months ago
fang503 / antflow
View on GitHub
AI agent platform enhanced with Agent OS architecture inspired by Claude Code, built on DeerFlow
☆62Apr 1, 2026Updated 3 months ago
pingponglabs / FaceAnime
View on GitHub
☆10Apr 22, 2021Updated 5 years ago
shekshaa / Symmetric-ICP
View on GitHub
Rigid-Body Mesh Registration with symmetrized ICP
☆20Jan 12, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
louisccc / KGppler
View on GitHub
☆12Apr 20, 2020Updated 6 years ago
JoseponLee / IntentQA
View on GitHub
Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.
☆26Nov 29, 2024Updated last year
ZHAOoops / AI-Notes
View on GitHub
Bilibili东川路第一可爱猫猫虫的AI笔记
☆274May 2, 2026Updated 2 months ago
tinyzqh / Algorithms_Note
View on GitHub
算法工程师技术栈学习笔记
☆15Aug 22, 2022Updated 3 years ago
datawhalechina / happy-llm
View on GitHub
📚 从零开始构建大模型
☆32,316May 6, 2026Updated 2 months ago
THUDM / SCALE-CUA
View on GitHub
Open-source framework for computer use agents: VeriGen verifiable task synthesis, online RL training (AgentRL), and OSWorld/ScienceBoard …
☆33Updated this week
Mr-Loevan / FAST
View on GitHub
[NeurIPS 2025 Spotlight] Fast-Slow Thinking GRPO for Large Vision-Language Model Reasoning
☆55Apr 16, 2026Updated 3 months ago
Lau-Jonathan / LLM-Agent-Interview-Guide
View on GitHub
🔥 大模型 & Agent 面试八股文完全指南 | LLM & Agent Interview Preparation Guide
☆587Feb 28, 2026Updated 4 months ago
wudu98 / autoGEMM
View on GitHub
☆15Dec 5, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
0324Lw / Deep-Reinforcement-Learning-Notes
View on GitHub
这是我的深度强化学习的学习笔记与总结
☆86Mar 18, 2026Updated 4 months ago
faithlumumba / 2025-tencent-advertising-algorithm-competition-finalist
View on GitHub
🎯 Build a winning recommendation system with this effective generative framework, advancing to the finals of the 2025 Tencent Advertisin…
☆27Updated this week
markwu7777 / markwu_compiler
View on GitHub
大连理工大学编译原理课程设计
☆10Jan 1, 2024Updated 2 years ago
ckd0817 / LLM-Interview-Code
View on GitHub
☆736Mar 26, 2026Updated 3 months ago
wdndev / llm_interview_note
View on GitHub
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
☆14,757Jun 14, 2026Updated last month
brown-palm / AntGPT
View on GitHub
Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
☆31Sep 23, 2024Updated last year
DeepReasoning / TECHS
View on GitHub
TECHS: Temporal Logical Graph Networks for Explainable Extrapolation Reasoning
☆10Jan 16, 2024Updated 2 years ago