OpenBMB / MiniCPM-CookBookLinks
This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achieving exceptional performance on the edge.
☆247Updated 7 months ago
Alternatives and similar repositories for MiniCPM-CookBook
Users that are interested in MiniCPM-CookBook are comparing it to the libraries listed below
Sorting:
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆339Updated 2 months ago
- ☆233Updated 4 months ago
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆199Updated last week
- ☆714Updated 3 weeks ago
- MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval☆187Updated last month
- ☆310Updated 6 months ago
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆496Updated last week
- 一些大语言模型和多模态模型的应用,主要包括小模型,Agent,跨模态搜索,OCR、RAG、ChatBot等等☆179Updated last week
- Alpaca Chinese Dataset -- 中文指令微调数据集☆206Updated 8 months ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆236Updated last week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆213Updated 4 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆265Updated last month
- Train a 1B LLM with 1T tokens from scratch by personal☆679Updated last month
- GLM Series Edge Models☆142Updated last week
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格 式synthetic data☆167Updated 6 months ago
- ☆228Updated last year
- Agentic RAG R1 Framework via Reinforcement Learning☆215Updated 3 weeks ago
- A LLM-based Agent that predict its tasks proactively.☆378Updated last month
- 顾名思义:手搓的RAG☆124Updated last year
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆172Updated this week
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆204Updated last month
- ☆222Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 6 months ago
- Collect every awesome work about r1!☆386Updated last month
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆441Updated last month
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆61Updated 9 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆561Updated 3 weeks ago
- Mixture-of-Experts (MoE) Language Model☆189Updated 9 months ago
- minimal-cost for training 0.5B R1-Zero☆742Updated last month
- 一些 LLM 方面的从零复现笔记☆203Updated last month