OpenBMB / MiniCPM-CookBookLinks
This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achieving exceptional performance on the edge.
☆295Updated 5 months ago
Alternatives and similar repositories for MiniCPM-CookBook
Users that are interested in MiniCPM-CookBook are comparing it to the libraries listed below
Sorting:
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆400Updated 8 months ago
- ☆241Updated 10 months ago
- ☆339Updated 2 months ago
- a toolkit on knowledge distillation for large language models☆223Updated this week
- LLM101n: Let's build a Storyteller 中文版☆136Updated last year
- GLM Series Edge Models☆156Updated 6 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆621Updated 6 months ago
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆194Updated 4 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆77Updated last year
- ☆755Updated this week
- ☆170Updated last year
- Train a 1B LLM with 1T tokens from scratch by personal☆772Updated 7 months ago
- A LLM-based Agent that predict its tasks proactively.☆454Updated 4 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆190Updated last year
- A small open source 3D agent simulator based on LLM.☆67Updated last year
- LLaMA Factory Document☆159Updated 2 weeks ago
- Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool☆597Updated this week
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆419Updated 2 months ago
- ☆235Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆217Updated last year
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- 顾名思义:手搓的RAG☆130Updated last year
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆465Updated 3 months ago
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆238Updated last month
- ☆105Updated 9 months ago
- 基于ReAct手搓一个Agent Demo☆159Updated 5 months ago
- DeepSeek 系列工作解读、扩展和复现。☆691Updated 8 months ago
- ☆233Updated last year
- 从0开始,将chatgpt的技术路线跑一遍。☆268Updated last year
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处 理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆106Updated last year