🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.
☆70Mar 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for minimind-notes
Users that are interested in minimind-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pretrain、Posttrain、RAG、Agent等大模型相关的基础项目合集☆32Dec 7, 2025Updated 3 months ago
- 引入腾讯游戏语音GCloudVoice,此sdk专门为游戏而集成(王者荣耀就是用这个),通过jni调用,可以完全使用其中的游戏语音功能,目前免费测试和使用,有兴趣的可以点击了解 https://www.qcloud.com/product/GVoice☆10Mar 29, 2017Updated 8 years ago
- This is a command line interface for the Rec Cloud Service (rec.ustc.edu.cn)☆15Oct 24, 2025Updated 5 months ago
- Official Repository of paper MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Pol…☆79Jan 26, 2026Updated last month
- Source code of our MM24 paper "Harmfully Manipulated Images Matter in Multimodal Misinformation Detection"☆18Aug 10, 2025Updated 7 months ago
- Talkmore with Opentypeless. Type with your voice. Anywhere. Talk - Recoding - Polish - Done!☆49Mar 12, 2026Updated last week
- Diff-SFCT: A Diffusion Model with Spatial-Frequency Cross Transformer for Medical Image Segmentation☆10Apr 15, 2024Updated last year
- ☆30Dec 14, 2025Updated 3 months ago
- ☆18Jun 16, 2025Updated 9 months ago
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆22Apr 26, 2025Updated 10 months ago
- A framework for evolving and testing question-answering datasets with various models.☆23Feb 28, 2024Updated 2 years ago
- 基于vuepress的静态个人简历☆11Aug 27, 2025Updated 6 months ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆23May 27, 2025Updated 9 months ago
- Implementation of MaNi: Maximizing Mutual Information for Nuclei Cross-Domain Unsupervised Segmentation☆12Jun 30, 2022Updated 3 years ago
- Code used for VLDB paper "The next 50 Years in Database Indexing or: The Case for Automatically Generated Index Structures"☆13Mar 31, 2022Updated 3 years ago
- Single Image Dehazing via Multi-scale Convolutional Neural Networks, ECCV 2016☆38Jan 21, 2018Updated 8 years ago
- 南京理工大学计算机考研复试上机题解☆14Jul 26, 2019Updated 6 years ago
- 北京邮电大学果园(国际学院)的资料库☆28Mar 5, 2026Updated 2 weeks ago
- 2023 徐云 算法基础 作业实验☆11Dec 9, 2023Updated 2 years ago
- Unsupervised fusion of misaligned PAT and MRI images via mutually reinforcing cross-modality image generation and registration☆15Oct 14, 2025Updated 5 months ago
- ☆21May 4, 2022Updated 3 years ago
- 能够在PC端SEU网上办事服务大厅的研究生素质讲座实现自动定时抢讲座,可以做到自动或者手动输入验证码,解放双手!☆21Nov 17, 2023Updated 2 years ago
- Asynchronous IO for C++20☆18Sep 26, 2023Updated 2 years ago
- ☆14Mar 27, 2023Updated 2 years ago
- Source code of the paper: Exploring Multi-View Pixel Contrast for General and Robust Image Forgery Localization, IEEE TIFS 2025.☆26Aug 8, 2025Updated 7 months ago
- ☆128Oct 11, 2025Updated 5 months ago
- ☆95Updated this week
- [WWW 2025] Code for Modality Interactive Mixture-of-Experts for Fake News Detection☆33Jun 25, 2025Updated 8 months ago
- 北京邮电大学生存指南,从沙河到本部,从入学到毕业的全程陪伴☆36Mar 17, 2026Updated last week
- 轻量级大语言模型MiniMind的源码解读,包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程☆833Jun 16, 2025Updated 9 months ago
- PyTorch impelementation for "Federated Recommendation via Hybrid Retrieval Augmented Generation".☆23Mar 8, 2024Updated 2 years ago
- Summary of PingCap tinykv camp. No codes presented.☆22May 9, 2023Updated 2 years ago
- Cell Graph Transformer for Nuclei Classification, AAAI 2024☆27Oct 8, 2024Updated last year
- documents for 深大飞跃手册☆36Oct 11, 2025Updated 5 months ago
- ☆24Dec 14, 2024Updated last year
- [NeurIPS 2025] 𝓡𝓣𝓥-𝓑𝓮𝓷𝓬𝓱: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video.☆32Jan 15, 2026Updated 2 months ago
- ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)☆35Nov 2, 2024Updated last year
- run ChatGLM2-6B in BM1684X☆48Mar 1, 2024Updated 2 years ago
- 微博情感分类数据集+爬虫+句嵌入+情感分类+作图☆26Dec 31, 2024Updated last year