🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.
☆57Feb 25, 2026Updated last week
Alternatives and similar repositories for minimind-notes
Users that are interested in minimind-notes are comparing it to the libraries listed below
Sorting:
- This is a command line interface for the Rec Cloud Service (rec.ustc.edu.cn)☆15Oct 24, 2025Updated 4 months ago
- Diff-SFCT: A Diffusion Model with Spatial-Frequency Cross Transformer for Medical Image Segmentation☆10Apr 15, 2024Updated last year
- 2023 徐云 算法基 础 作业实验☆11Dec 9, 2023Updated 2 years ago
- ☆19Jun 29, 2025Updated 8 months ago
- ☆14Mar 27, 2023Updated 2 years ago
- Source code of our MM24 paper "Harmfully Manipulated Images Matter in Multimodal Misinformation Detection"☆18Aug 10, 2025Updated 6 months ago
- ☆17Jan 9, 2024Updated 2 years ago
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆22Apr 26, 2025Updated 10 months ago
- 能够在PC端SEU网上办事服务大厅的研究生素质讲座实现自动定时抢讲座,可以做到自动或者手动输入验证码,解放双手!☆21Nov 17, 2023Updated 2 years ago
- Unsupervised fusion of misaligned PAT and MRI images via mutually reinforcing cross-modality image generation and registration☆15Oct 14, 2025Updated 4 months ago
- A copy of the course page, including all the pages and information☆17Dec 5, 2024Updated last year
- PointNu-Net Project☆18Dec 28, 2023Updated 2 years ago
- [ACL 2024] PyTorch implementation for "Stealthy Attack on Large Language Model based Recommendation"☆19Jun 19, 2024Updated last year
- 北京邮电大学生存指南,从沙河到本部,从入学到毕业的全程陪伴☆33Updated this week
- ☆21May 4, 2022Updated 3 years ago
- Source code of the paper: Exploring Multi-View Pixel Contrast for General and Robust Image Forgery Localization, IEEE TIFS 2025.☆25Aug 8, 2025Updated 6 months ago
- ☆24Dec 14, 2024Updated last year
- In this repository, I share some useful resources that you should know before pursuing your Master's or Ph.D. degree.☆24Jan 12, 2025Updated last year
- TPAMI 2025: Spatial Frequency Modulation for Semantic Segmentation☆44Jan 28, 2026Updated last month
- BUPT Joint Programme with QMUL☆21Dec 21, 2023Updated 2 years ago
- Deep Hierarchical Video Compression☆33Jan 30, 2026Updated last month
- Cell Graph Transformer for Nuclei Classification, AAAI 2024☆27Oct 8, 2024Updated last year
- [WWW 2025] Code for Modality Interactive Mixture-of-Experts for Fake News Detection☆30Jun 25, 2025Updated 8 months ago
- ☆23Jun 21, 2023Updated 2 years ago
- A lightweight library that implements state-of-the-art few-shot learning algorithms.☆24Apr 18, 2021Updated 4 years ago
- Training platform for End-to-End compression models, losses and metrics defined in Compressai☆24Nov 30, 2023Updated 2 years ago
- 仿照大众点评☆37Feb 23, 2025Updated last year
- The Video Conferencing Dataset (VCD) to evaluate video codecs for video conferencing.☆29May 15, 2024Updated last year
- [ICCV 2025] MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance☆49Sep 24, 2025Updated 5 months ago
- 每周AI论文速递,主要来源于HF的Daily Paper,但也会参入一些其他热门论文☆35Feb 15, 2026Updated 2 weeks ago
- ☆31Sep 12, 2023Updated 2 years ago
- PyTorch re-implementation of Transformer-based Transform Coding☆29Dec 28, 2023Updated 2 years ago
- [ICLR 2025] Official implementation for "SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanati…☆43Feb 11, 2025Updated last year
- 中科大徐云老师软件学院算法导论课程☆27May 20, 2020Updated 5 years ago
- CVPR18: Learning and Using the Arrow of Time☆40Feb 11, 2022Updated 4 years ago
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆61Jan 4, 2026Updated 2 months ago
- ☆53Dec 4, 2025Updated 3 months ago
- The official repository of Real Text Manipulation (RTM)☆43Mar 18, 2025Updated 11 months ago
- Learning a Deep Dual-level Network for Robust DeepFake Detection☆33Jun 13, 2022Updated 3 years ago