🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.
☆123Jun 4, 2026Updated last week
Alternatives and similar repositories for minimind-notes
Users that are interested in minimind-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models☆18Feb 27, 2025Updated last year
- This is a command line interface for the Rec Cloud Service (rec.ustc.edu.cn)☆15Oct 24, 2025Updated 7 months ago
- Official Pytorch implementation for DCVC-SDD: [Spatial Decomposition and Temporal Fusion Based Inter Prediction for Learned Video Compres…☆17Feb 3, 2025Updated last year
- Custom YOLOv4 for apple recognition (clean/damaged) on Alveo U280 accelerator card using Vitis AI framework.☆15Nov 1, 2021Updated 4 years ago
- Official repo for ECCV 2024 paper: Fast Encoding and Decoding for Implicit Video Representation☆16Jul 24, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆22Jan 16, 2025Updated last year
- 南京理工大学计算机软件与工程学院复试资源☆10Nov 16, 2019Updated 6 years ago
- ☆21Jun 29, 2025Updated 11 months ago
- 轻量级大语言模型MiniMind的源码解读,包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程☆1,061Jun 16, 2025Updated 11 months ago
- Diff-SFCT: A Diffusion Model with Spatial-Frequency Cross Transformer for Medical Image Segmentation☆10Apr 15, 2024Updated 2 years ago
- Natural Language-centered Inference Network for Multi-modal Fake News Detection☆12Sep 23, 2024Updated last year
- 🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diver…☆212May 19, 2026Updated 3 weeks ago
- Source code of our MM24 paper "Harmfully Manipulated Images Matter in Multimodal Misinformation Detection"☆19Aug 10, 2025Updated 10 months ago
- Implementation for Machine-Generated Text Localization (ACL 2024 Findings)☆14Jun 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network☆16Feb 11, 2025Updated last year
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆28Jan 16, 2026Updated 4 months ago
- ☆35Dec 14, 2025Updated 6 months ago
- 厦门大学信息学院 计算机图形学课程相关实验全纪录 OpenGL VS2019☆16Jun 15, 2022Updated 3 years ago
- 基于vuepress的静态个人简历☆11Mar 24, 2026Updated 2 months ago
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆24Apr 26, 2025Updated last year
- Implementation of MaNi: Maximizing Mutual Information for Nuclei Cross-Domain Unsupervised Segmentation☆12Jun 30, 2022Updated 3 years ago
- ☆17Jan 9, 2024Updated 2 years ago
- A framework for evolving and testing question-answering datasets with various models.☆26Feb 28, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- leeml-notes已更名为leedl-tutorial,请访问:https://github.com/datawhalechina/leedl-tutorial☆25May 27, 2024Updated 2 years ago
- 用于研读LevelDB源码时进行注释,持续更新☆12Feb 23, 2023Updated 3 years ago
- My website.☆38Mar 22, 2026Updated 2 months ago
- Deep Hierarchical Video Compression☆38May 21, 2026Updated 3 weeks ago
- 2023 徐云 算法基础 作业实验☆11Dec 9, 2023Updated 2 years ago
- 南京理工大学计算机考研复试上机题解☆14Jul 26, 2019Updated 6 years ago
- A unique_ptr implementation with small object optimization☆20Feb 8, 2026Updated 4 months ago
- Unsupervised fusion of misaligned PAT and MRI images via mutually reinforcing cross-modality image generation and registration☆16Updated this week
- ☆22May 4, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Training platform for End-to-End compression models, losses and metrics defined in Compressai☆26Nov 30, 2023Updated 2 years ago
- A lightweight library that implements state-of-the-art few-shot learning algorithms.☆25Apr 18, 2021Updated 5 years ago
- PointNu-Net Project☆19Dec 28, 2023Updated 2 years ago
- 北京邮电大学果园(国际学院)的资料库☆43Apr 16, 2026Updated last month
- Simple implementation of Retrieval-Augmented Generation System☆28Oct 24, 2024Updated last year
- TPAMI 2025: Spatial Frequency Modulation for Semantic Segmentation☆50Jan 28, 2026Updated 4 months ago
- PyTorch impelementation for "Federated Recommendation via Hybrid Retrieval Augmented Generation".☆23Mar 8, 2024Updated 2 years ago