🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.
☆87Apr 5, 2026Updated last week
Alternatives and similar repositories for minimind-notes
Users that are interested in minimind-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pretrain、Posttrain、RAG、Agent等大模型相关的基础项目合集☆37Dec 7, 2025Updated 4 months ago
- ☆91Updated this week
- An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models☆15Feb 27, 2025Updated last year
- 南京理工大学计算机软件与工程学院复试资源☆10Nov 16, 2019Updated 6 years ago
- Source code of our MM24 paper "Harmfully Manipulated Images Matter in Multimodal Misinformation Detection"☆18Aug 10, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Repository of paper MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Pol…☆79Jan 26, 2026Updated 2 months ago
- AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network☆16Feb 11, 2025Updated last year
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆27Jan 16, 2026Updated 2 months ago
- [AAAI 2025] Official code for paper: DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image S…☆18Jun 16, 2025Updated 9 months ago
- Focused Papers, Delivered Simply :)☆54Dec 25, 2025Updated 3 months ago
- A framework for evolving and testing question-answering datasets with various models.☆23Feb 28, 2024Updated 2 years ago
- Code used for VLDB paper "The next 50 Years in Database Indexing or: The Case for Automatically Generated Index Structures"☆14Mar 31, 2022Updated 4 years ago
- leeml-notes已更名为leedl-tutorial,请访问:https://github.com/datawhalechina/leedl-tutorial☆25May 27, 2024Updated last year
- 用于研读LevelDB源码时进行注释,持续更新☆12Feb 23, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 南京理工大学计算机考研复试上机题解☆14Jul 26, 2019Updated 6 years ago
- 2023 徐云 算法基 础 作业实验☆11Dec 9, 2023Updated 2 years ago
- Unsupervised fusion of misaligned PAT and MRI images via mutually reinforcing cross-modality image generation and registration☆15Oct 14, 2025Updated 6 months ago
- Source code of the paper: Exploring Multi-View Pixel Contrast for General and Robust Image Forgery Localization, IEEE TIFS 2025.☆27Aug 8, 2025Updated 8 months ago
- Code repository for the ECAI 2025 paper: Diffusion Noise Feature: Accurate and Fast Generated Image Detection.☆25Jan 28, 2026Updated 2 months ago
- A lightweight library that implements state-of-the-art few-shot learning algorithms.☆25Apr 18, 2021Updated 4 years ago
- PointNu-Net Project☆19Dec 28, 2023Updated 2 years ago
- Simple implementation of Retrieval-Augmented Generation System☆29Oct 24, 2024Updated last year
- TPAMI 2025: Spatial Frequency Modulation for Semantic Segmentation☆47Jan 28, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV 2025] MultiverSeg: Scalable Interactive Segmentation of Biomedical Imaging Datasets with In-Context Guidance☆51Sep 24, 2025Updated 6 months ago
- A curated list of works related to Misinformation Video Detection, as a companion material for an ACM Multimedia 2023 survey☆132Sep 22, 2025Updated 6 months ago
- In this repository, I share some useful resources that you should know before pursuing your Master's or Ph.D. degree.☆24Jan 12, 2025Updated last year
- 北京邮电大学生存指南,从沙河到本部,从入学到毕业的全程陪伴☆36Mar 17, 2026Updated 3 weeks ago
- documents for 深大飞跃手册☆36Oct 11, 2025Updated 6 months ago
- ☆116Updated this week
- LLM-Check: Investigating Detection of Hallucinations in Large Language Models (NeurIPS 2024)☆38Dec 8, 2024Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Aug 27, 2023Updated 2 years ago
- 微博情感分类数据集+爬虫+句嵌入+情感分类+作图☆27Dec 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆20Apr 11, 2022Updated 4 years ago
- Fact-checking system for textual and visual inputs.☆51Feb 24, 2026Updated last month
- Emotional Analysis of Comments on Commodities, based on word2vec+LSTM☆20Apr 20, 2020Updated 5 years ago
- [NeurIPS 2025] 𝓡𝓣𝓥-𝓑𝓮𝓷𝓬𝓱: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video.☆33Jan 15, 2026Updated 2 months ago
- 小清新风个人博客,springboot倾情打造 https://blog.listeningrain.cn☆21Nov 16, 2022Updated 3 years ago
- Gym-Anything: Turn any Software into an Agent Environment☆140Updated this week
- 仿照大众点评☆41Feb 23, 2025Updated last year