轻量级大语言模型MiniMind的源码解读,包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程
☆784Jun 16, 2025Updated 9 months ago
Alternatives and similar repositories for MiniMind-in-Depth
Users that are interested in MiniMind-in-Depth are comparing it to the libraries listed below
Sorting:
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆41,349Feb 6, 2026Updated last month
- I love reinforcement learning.☆12Jan 15, 2025Updated last year
- 🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building…☆67Mar 5, 2026Updated last week
- The simplest Local Knowledge Base example based on Langchain and Chat-GLM☆13Jun 9, 2023Updated 2 years ago
- 🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆6,697Feb 4, 2026Updated last month
- 🚀 轻量视频🎥 大模型🤖☆21Apr 27, 2025Updated 10 months ago
- [ACM MM 2025] Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation☆49Oct 29, 2025Updated 4 months ago
- BUPT Joint Programme with QMUL☆21Dec 21, 2023Updated 2 years ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题☆13,086Apr 30, 2025Updated 10 months ago
- 解锁HuggingFace生态的百般用法☆98Dec 14, 2024Updated last year
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆28,880Mar 8, 2026Updated last week
- 北航《并行程序设计》Lab合集(竞速Rank1)☆31Feb 23, 2023Updated 3 years ago
- 📚 从零开始的大语言模型原理与实践教程☆27,303Mar 5, 2026Updated last week
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆23,522Updated this week
- ☆15Apr 23, 2025Updated 10 months ago
- This repository contains source code and a high-quality test dataset for "Automated Commit Message Generation with Large Language Models.…☆10Nov 6, 2025Updated 4 months ago
- AQIPython is a Python module that calculates the Air Quality Index (AQI) for various air pollutants based on different standards.☆10Mar 5, 2024Updated 2 years ago
- breast Cancer乳腺癌数据挖掘,python sklearn☆11Apr 13, 2019Updated 6 years ago
- ☆707Jan 12, 2026Updated 2 months ago
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆4,592Feb 12, 2026Updated last month
- All perimeter-x files deobfuscated (atleast from the versions i could find)☆17Mar 23, 2025Updated 11 months ago
- AI 学习之旅☆103Jul 24, 2025Updated 7 months ago
- An AVX Lifter for the Hex-Rays Decompiler + new instructions☆11Oct 14, 2022Updated 3 years ago
- a various table design method, you can design the style as you want.☆12Oct 7, 2018Updated 7 years ago
- Scraper for aqicn.org☆11Sep 4, 2018Updated 7 years ago
- [WACV 2025] Cross-Task Affinity Learning for Multitask Dense Scene Predictions☆11Jun 12, 2025Updated 9 months ago
- Free-Vortex Wake Modelling with Discrete Adjoint☆13Aug 25, 2022Updated 3 years ago
- okHttp+RxJava封装一个类似retrofit的框架☆10Sep 17, 2018Updated 7 years ago
- AeroTop: an efficient aerodynamic topology optimization framework☆12Apr 1, 2022Updated 3 years ago
- Implementation of GALS (GNSS-Augmented LiDAR SLAM)☆14Jul 5, 2022Updated 3 years ago
- Arche is a Greek word with primary senses "beginning". The repository defines a framework for technology mapping of emerging technologies…☆11May 15, 2020Updated 5 years ago
- A python wrapper for the QuantAQ RESTful API☆11Dec 24, 2025Updated 2 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- Ejemplo de cómo trabajar con gráficos en Kotlin☆12Sep 29, 2022Updated 3 years ago
- Android CameraX + OpenGL。虽然 CameraX 已经封装了大部分对相机操作,但想要基于 OpenGL 做一个可以自定义渲染流程的相机还是有不少东西需要处理,如 OpenGL 环境的封装,需要自定义的 SurfaceProvider,和图片视频捕获。 …☆11Nov 18, 2024Updated last year
- A distributed filesystem☆10Jan 31, 2017Updated 9 years ago
- A port of the RWKV v7 language model, implemented with the Burn deep learning framework☆14Jun 9, 2025Updated 9 months ago
- Write events for TensorBoard☆11Jun 27, 2024Updated last year
- Simple implementation of a custom parquet reader/writer☆11Aug 12, 2016Updated 9 years ago