jingyaogong / minimind-vLinks
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
☆5,588Updated this week
Alternatives and similar repositories for minimind-v
Users that are interested in minimind-v are comparing it to the libraries listed below
Sorting:
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆35,166Updated last week
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆4,103Updated last week
- 复现大模型相关算法及一些学习记录☆2,648Updated this week
- ⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …☆3,207Updated this week
- 轻量级大语言模型MiniMind的源码解读,包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程☆469Updated 5 months ago
- 从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!☆1,860Updated 2 weeks ago
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,790Updated last year
- 每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈☆4,812Updated last month
- 大模型基础: 一文了解大模型基础知识☆6,245Updated 9 months ago
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆26,411Updated this week
- 《大语言模型》作者 :赵鑫,李军毅,周昆,唐天一,文继荣☆4,130Updated 3 months ago
- 这是一份入门AI/LLM大模型的逐步指南,包含教程和演示代码,带你从API走进本地大模型部署和微调,代码文件会提供Kaggle或Colab在线版本,即便没有显卡也可以进行学习。项目中还开设了一个小型的代码游乐场🎡,你可以尝试在里面实验一些有意思的AI脚本。同时,包含李宏毅…☆3,229Updated 2 months ago
- Use interactive notebook to break down MiniMind code and learn from scratch.☆115Updated 8 months ago
- 🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )☆1,932Updated last week
- ☆13,126Updated 10 months ago
- 中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型☆1,731Updated last month
- 🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique styl…☆15,871Updated last week
- ☆1,185Updated 2 months ago
- 从零实现一个 llama3 中文版☆989Updated last year
- 🧑🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.☆6,912Updated this week
- 个人构建MoE大模型:从预训练到DPO的完整实践☆1,938Updated this week
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆22,088Updated last week
- Practice to LLM.☆2,056Updated 2 weeks ago
- Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.☆2,146Updated this week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (…☆11,418Updated this week
- 动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/☆2,012Updated 3 weeks ago
- 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并…☆2,705Updated 3 months ago
- 本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/☆10,974Updated 2 months ago
- 制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程☆1,620Updated 7 months ago
- 📚 从零开始的大语言模型原理与实践教程☆22,131Updated 3 weeks ago