jingyaogong / minimind
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
☆11,363Updated this week
Alternatives and similar repositories for minimind:
Users that are interested in minimind are comparing it to the libraries listed below
- 🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM!🌏 Train a 27M-parameter VLM from scratch in just 3 hours!☆1,126Updated last week
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆12,758Updated last week
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆10,824Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆18,531Updated this week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,852Updated 3 weeks ago
- ☆7,223Updated last month
- Question and Answer based on Anything.☆12,543Updated 3 months ago
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆31,152Updated this week
- 整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。☆18,313Updated 5 months ago
- 💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, Qwen2, OpenAI,…☆13,721Updated this week
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆5,891Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆6,494Updated this week
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆2,998Updated 9 months ago
- Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 15…☆5,674Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆41,097Updated this week
- 本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/☆5,869Updated this week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆10,875Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆6,914Updated last week
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆2,238Updated last week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆21,511Updated this week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆2,294Updated 6 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,030Updated last month
- Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用☆14,434Updated 5 months ago
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆14,362Updated last week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆38,028Updated this week
- Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)☆4,127Updated 5 months ago
- SOTA Open Source TTS☆19,362Updated this week
- GLM-4-Voice | 端到端中英语音对话模型☆2,669Updated 2 months ago
- LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key …☆22,691Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆16,936Updated 2 weeks ago