OpenBMB / MiniCPMLinks
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
☆8,489Updated 3 months ago
Alternatives and similar repositories for MiniCPM
Users that are interested in MiniCPM are comparing it to the libraries listed below
Sorting:
- MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone☆22,625Updated 3 months ago
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆10,950Updated this week
- open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…☆3,510Updated last year
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,143Updated 2 months ago
- Mobile-Agent: The Powerful GUI Agent Family☆6,976Updated last month
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,159Updated last week
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,051Updated this week
- Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)☆2,763Updated last year
- UFO³: Weaving the Digital Agent Galaxy☆7,948Updated last week
- 百亿参数的中英文双语基座大模型☆2,412Updated 2 years ago
- Align Anything: Training All-modality Model with Feedback☆4,620Updated last month
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆7,040Updated 6 months ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,551Updated this week
- A series of large language models trained from scratch by developers @01-ai☆7,842Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (…☆12,112Updated this week
- C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)☆2,968Updated last year
- An Autonomous LLM Agent for Complex Task Solving☆8,484Updated last year
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,584Updated 2 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,289Updated last month
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,890Updated last year
- Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sour…☆1,464Updated 10 months ago
- Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)☆1,941Updated 2 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆6,478Updated last year
- 🚀 The best real-time interactive AI avatar(digital human) with on-premise deployment and <1.5 s latency.☆7,737Updated 2 weeks ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,121Updated last year
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,521Updated this week
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,959Updated last year
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆5,474Updated 7 months ago
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,427Updated 10 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,920Updated 3 months ago