OpenBMB / MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
☆13,445Updated this week
Alternatives and similar repositories for MiniCPM-o:
Users that are interested in MiniCPM-o are comparing it to the libraries listed below
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆7,066Updated 2 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆6,793Updated 3 weeks ago
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆11,800Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆15,373Updated last month
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,320Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆5,961Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆5,719Updated last week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆38,227Updated this week
- "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆12,800Updated this week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,035Updated 3 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆6,576Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆33,809Updated this week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆4,158Updated this week
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆6,675Updated this week
- Mobile-Agent: The Powerful Mobile Device Operation Assistant Family☆3,180Updated 3 months ago
- Retrieval and Retrieval-augmented LLMs☆8,237Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆21,705Updated this week
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,204Updated 4 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆16,235Updated this week
- The Memory layer for your AI apps☆23,953Updated this week
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆7,497Updated this week
- Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆4,207Updated this week
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆5,730Updated 2 weeks ago
- A series of large language models trained from scratch by developers @01-ai☆7,784Updated last month
- Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory☆20,611Updated this week
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,315Updated 5 months ago
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆18,641Updated this week
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,119Updated 4 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,197Updated this week
- High-speed Large Language Model Serving on PCs with Consumer-grade GPUs☆8,050Updated 4 months ago