🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.
☆135Jun 4, 2026Updated last month
Alternatives and similar repositories for minimind-notes
Users that are interested in minimind-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 支持RTMDet、YOLOv8、YOLOX、Faster R-CNN等常见算法的ncnn部署☆13Mar 17, 2024Updated 2 years ago
- Towards Understanding Deep Learning Representations via Interactive Experimentation☆25May 5, 2017Updated 9 years ago
- This is a command line interface for the Rec Cloud Service (rec.ustc.edu.cn)☆15Oct 24, 2025Updated 8 months ago
- Official Pytorch implementation for DCVC-SDD: [Spatial Decomposition and Temporal Fusion Based Inter Prediction for Learned Video Compres…☆17Feb 3, 2025Updated last year
- Custom YOLOv4 for apple recognition (clean/damaged) on Alveo U280 accelerator card using Vitis AI framework.☆15Nov 1, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repo for ECCV 2024 paper: Fast Encoding and Decoding for Implicit Video Representation☆16Jul 24, 2025Updated 11 months ago
- NewsPilot is an automated intelligence analysis system based on Large Language Models (LLM), designed to transform massive global news in…☆54Apr 30, 2026Updated 2 months ago
- 轻量级大语言模型MiniMind的源码解读,包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程☆1,091Jun 16, 2025Updated last year
- Official Repository of paper MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Pol…☆69Jan 26, 2026Updated 5 months ago
- Source code of our MM24 paper "Harmfully Manipulated Images Matter in Multimodal Misinformation Detection"