🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.
☆102Apr 5, 2026Updated 3 weeks ago
Alternatives and similar repositories for minimind-notes
Users that are interested in minimind-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pretrain、Posttrain、RAG、Agent等大模型相关的基础项目合集☆39Dec 7, 2025Updated 4 months ago
- ☆93Updated this week
- This is a command line interface for the Rec Cloud Service (rec.ustc.edu.cn)☆15Oct 24, 2025Updated 6 months ago
- Custom YOLOv4 for apple recognition (clean/damaged) on Alveo U280 accelerator card using Vitis AI framework.☆15Nov 1, 2021Updated 4 years ago
- 南京理工大学计算机软件与工程学院复试资源☆10Nov 16, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Jun 29, 2025Updated 10 months ago
- Official Repository of paper MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Pol…☆79Jan 26, 2026Updated 3 months ago
- Diff-SFCT: A Diffusion Model with Spatial-Frequency Cross Transformer for Medical Image Segmentation☆10Apr 15, 2024Updated 2 years ago
- Source code of our MM24 paper "Harmfully Manipulated Images Matter in Multimodal Misinformation Detection"☆19Aug 10, 2025Updated 8 months ago
- AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network☆16Feb 11, 2025Updated last year
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆27Jan 16, 2026Updated 3 months ago
- [AAAI 2025] Official code for paper: DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image S…☆18Jun 16, 2025Updated 10 months ago
- Focused Papers, Delivered Simply :)☆55Dec 25, 2025Updated 4 months ago
- 厦门大学信息学院 计算机图形学课程相关实验全纪录 OpenGL VS2019☆16Jun 15, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆32Dec 14, 2025Updated 4 months ago
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆24Apr 26, 2025Updated last year
- Implementation of MaNi: Maximizing Mutual Information for Nuclei Cross-Domain Unsupervised Segmentation☆12Jun 30, 2022Updated 3 years ago
- A framework for evolving and testing question-answering datasets with various models.☆24Feb 28, 2024Updated 2 years ago
- Code used for VLDB paper "The next 50 Years in Database Indexing or: The Case for Automatically Generated Index Structures"☆14Mar 31, 2022Updated 4 years ago
- 用于研读LevelDB源码时进行注释,持续更新☆12Feb 23, 2023Updated 3 years ago
- Repo for collaboration on OSS agentic code search☆56Updated this week
- 在verl上做reward的定制开发☆151May 22, 2025Updated 11 months ago
- ☆24Jun 21, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 南京理工大学计算机考研复试上机题解☆14Jul 26, 2019Updated 6 years ago
- 2023 徐云 算法基础 作业实验☆11Dec 9, 2023Updated 2 years ago
- 北京邮电大学果园(国际学院)的资料库☆37Apr 16, 2026Updated 2 weeks ago
- Unsupervised fusion of misaligned PAT and MRI images via mutually reinforcing cross-modality image generation and registration☆16Oct 14, 2025Updated 6 months ago
- ☆21May 4, 2022Updated 4 years ago
- [ACL 2024] PyTorch implementation for "Stealthy Attack on Large Language Model based Recommendation"☆20Jun 19, 2024Updated last year
- Asynchronous IO for C++20☆18Sep 26, 2023Updated 2 years ago
- Training platform for End-to-End compression models, losses and metrics defined in Compressai☆26Nov 30, 2023Updated 2 years ago
- Code repository for the ECAI 2025 paper: Diffusion Noise Feature: Accurate and Fast Generated Image Detection.☆25Jan 28, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A lightweight library that implements state-of-the-art few-shot learning algorithms.☆25Apr 18, 2021Updated 5 years ago
- CVPR18: Learning and Using the Arrow of Time☆40Feb 11, 2022Updated 4 years ago
- PointNu-Net Project☆19Dec 28, 2023Updated 2 years ago
- Simple implementation of Retrieval-Augmented Generation System☆29Oct 24, 2024Updated last year
- [WWW 2025] Code for Modality Interactive Mixture-of-Experts for Fake News Detection☆37Jun 25, 2025Updated 10 months ago
- A curated list of works related to Misinformation Video Detection, as a companion material for an ACM Multimedia 2023 survey☆132Sep 22, 2025Updated 7 months ago
- 厦门大学数字媒体技术本科资料收集中。Collecting studying materials for DMT in Xiamen University.☆35Feb 10, 2026Updated 2 months ago