A Toolkit for Running On-device Large Language Models (LLMs) in APP
☆83Jul 4, 2024Updated last year
Alternatives and similar repositories for MobileCPM
Users that are interested in MobileCPM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆308Jul 1, 2025Updated 10 months ago
- AI Emoji Argue Agent 🚀 基于LangChain的开源表情包斗图Agent☆29May 30, 2024Updated last year
- ☆10Mar 13, 2023Updated 3 years ago
- ☆323Sep 18, 2024Updated last year
- GLM Series Edge Models☆163Jun 12, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆229Apr 2, 2025Updated last year
- An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset☆28Jan 19, 2025Updated last year
- ☆33Jul 15, 2025Updated 10 months ago
- ☆47Jun 11, 2025Updated 11 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆189Mar 11, 2025Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆79Dec 8, 2025Updated 5 months ago
- Our 2nd-gen LMM☆34May 22, 2024Updated 2 years ago
- The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"☆22Apr 22, 2025Updated last year
- ☆18Dec 7, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025☆107Mar 14, 2025Updated last year
- Official implementation of paper: LiNo: Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Serie…☆18Dec 19, 2025Updated 5 months ago
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- ☆35Oct 9, 2025Updated 7 months ago
- Online Preference Alignment for Language Models via Count-based Exploration☆18Jan 14, 2025Updated last year
- A local search system implementation using Elasticsearch for Wikipedia data indexing and retrieval.☆14May 17, 2025Updated last year
- ☆83Apr 18, 2024Updated 2 years ago
- CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge tec…☆240Jan 14, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Source code for the paper "Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data"☆20Feb 24, 2024Updated 2 years ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆258Oct 30, 2024Updated last year
- A UI interface for Gemini | 快速与 Gemini 对话☆13Dec 17, 2023Updated 2 years ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆309Oct 18, 2024Updated last year
- ncnn android paddle ocr v5☆172Jan 14, 2026Updated 4 months ago
- MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks☆8,919Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆26Jun 10, 2025Updated 11 months ago
- ☆39Jun 18, 2025Updated 11 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆140Jun 12, 2024Updated last year
- ☆243Feb 21, 2025Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 7 months ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆33Nov 2, 2025Updated 6 months ago
- The code corresponding to the paper "Improving Sample Efficiency of Deep Reinforcement Learning for Bipedal Walking".☆23Aug 8, 2022Updated 3 years ago