OpenBMB / MiniCPM-oLinks
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
☆19,471Updated 2 weeks ago
Alternatives and similar repositories for MiniCPM-o
Users that are interested in MiniCPM-o are comparing it to the libraries listed below
Sorting:
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆7,370Updated 6 months ago
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆10,709Updated 2 weeks ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆8,174Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆48,531Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆8,967Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,439Updated last week
- Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥☆39,558Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆50,485Updated last week
- "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆16,850Updated this week
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,938Updated 9 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆25,396Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18,358Updated last month
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆21,673Updated last week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,368Updated 4 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆14,667Updated this week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-…☆7,769Updated this week
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆6,912Updated 3 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,473Updated last week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,824Updated 2 months ago
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆6,592Updated last month
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆55,898Updated 2 weeks ago
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆8,588Updated last week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆38,945Updated this week
- Question and Answer based on Anything.☆13,186Updated 2 months ago
- A simple screen parsing tool towards pure vision based GUI agent☆22,258Updated 2 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆6,427Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,766Updated 3 weeks ago
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆16,619Updated this week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆4,568Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,174Updated this week