🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A minimal, principle-first guide to understanding and building LLMs from scratch.
☆114May 8, 2026Updated 2 weeks ago
Alternatives and similar repositories for minimind-notes
Users that are interested in minimind-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository for our paper "Doctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement Learning" pu…☆35Apr 11, 2026Updated last month
- 哈尔滨工业大学计算机课程资料与实验☆23Apr 3, 2024Updated 2 years ago
- Custom YOLOv4 for apple recognition (clean/damaged) on Alveo U280 accelerator card using Vitis AI framework.☆15Nov 1, 2021Updated 4 years ago
- Official repo for ECCV 2024 paper: Fast Encoding and Decoding for Implicit Video Representation☆16Jul 24, 2025Updated 10 months ago
- 南京理工大学计算机软件与工程学院复试资源☆10Nov 16, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆21Jun 29, 2025Updated 10 months ago
- Official Repository of paper MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Pol…☆74Jan 26, 2026Updated 3 months ago
- Diff-SFCT: A Diffusion Model with Spatial-Frequency Cross Transformer for Medical Image Segmentation☆10Apr 15, 2024Updated 2 years ago
- Source code of our MM24 paper "Harmfully Manipulated Images Matter in Multimodal Misinformation Detection"☆19Aug 10, 2025Updated 9 months ago
- AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network☆16Feb 11, 2025Updated last year
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆27Jan 16, 2026Updated 4 months ago
- Focused Papers, Delivered Simply :)☆55Dec 25, 2025Updated 5 months ago
- ☆32Dec 14, 2025Updated 5 months ago
- 厦门大学信息学院 计算机图形学课程相关实验全纪录 OpenGL VS2019☆16Jun 15, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于vuepress的静态个人简历☆11Mar 24, 2026Updated 2 months ago
- Implementation of MaNi: Maximizing Mutual Information for Nuclei Cross-Domain Unsupervised Segmentation☆12Jun 30, 2022Updated 3 years ago
- Code used for VLDB paper "The next 50 Years in Database Indexing or: The Case for Automatically Generated Index Structures"☆14Mar 31, 2022Updated 4 years ago
- leeml-notes已更名为leedl-tutorial,请访问:https://github.com/datawhalechina/leedl-tutorial☆25May 27, 2024Updated last year
- 2023 徐云 算法基础 作业实验☆11Dec 9, 2023Updated 2 years ago
- ☆24Jun 21, 2023Updated 2 years ago
- 南京理工大学计算机考研复试上机题解☆14Jul 26, 2019Updated 6 years ago
- A unique_ptr implementation with small object optimization☆20Feb 8, 2026Updated 3 months ago
- Unsupervised fusion of misaligned PAT and MRI images via mutually reinforcing cross-modality image generation and registration☆16Oct 14, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆22May 4, 2022Updated 4 years ago
- The Video Conferencing Dataset (VCD) to evaluate video codecs for video conferencing.☆33May 15, 2024Updated 2 years ago
- [ACL 2024] PyTorch implementation for "Stealthy Attack on Large Language Model based Recommendation"☆21Jun 19, 2024Updated last year
- Asynchronous IO for C++20☆18Sep 26, 2023Updated 2 years ago
- Training platform for End-to-End compression models, losses and metrics defined in Compressai☆26Nov 30, 2023Updated 2 years ago
- Source code of the paper: Exploring Multi-View Pixel Contrast for General and Robust Image Forgery Localization, IEEE TIFS 2025.☆28Aug 8, 2025Updated 9 months ago
- Code repository for the ECAI 2025 paper: Diffusion Noise Feature: Accurate and Fast Generated Image Detection.☆25Jan 28, 2026Updated 3 months ago
- A lightweight library that implements state-of-the-art few-shot learning algorithms.☆25Apr 18, 2021Updated 5 years ago
- PointNu-Net Project☆19Dec 28, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A copy of the course page, including all the pages and information☆17Dec 5, 2024Updated last year
- Simple implementation of Retrieval-Augmented Generation System☆28Oct 24, 2024Updated last year
- TPAMI 2025: Spatial Frequency Modulation for Semantic Segmentation☆48Jan 28, 2026Updated 3 months ago
- [WWW 2025] Code for Modality Interactive Mixture-of-Experts for Fake News Detection☆39Jun 25, 2025Updated 10 months ago
- PyTorch impelementation for "Federated Recommendation via Hybrid Retrieval Augmented Generation".☆23Mar 8, 2024Updated 2 years ago
- Summary of PingCap tinykv camp. No codes presented.☆22May 9, 2023Updated 3 years ago
- In this repository, I share some useful resources that you should know before pursuing your Master's or Ph.D. degree.☆25Jan 12, 2025Updated last year