OpenBMB / AgentCPM-GUILinks
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
☆759Updated this week
Alternatives and similar repositories for AgentCPM-GUI
Users that are interested in AgentCPM-GUI are comparing it to the libraries listed below
Sorting:
- Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users☆406Updated last month
- An open-sourced end-to-end VLM-based GUI Agent☆960Updated 2 months ago
- 🌐 WebWalker [ACL2025] & WebDancer [Preprint]☆843Updated last week
- ☆743Updated this week
- ☆157Updated last week
- A simple agent framework that's capable of browser use + mcp + auto instrument + plan + deep research + more☆250Updated last week
- ☆478Updated 3 months ago
- ☆247Updated 9 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆257Updated 2 weeks ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆274Updated 4 months ago
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆479Updated 2 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆475Updated 3 weeks ago
- A General-Purpose AI Agent ✨☆348Updated last week
- ☆65Updated 7 months ago
- A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine lang…☆852Updated 2 months ago
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,278Updated last week
- Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video ge…☆527Updated 3 weeks ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆329Updated last month
- Query and Summarize your chat messages.☆972Updated 6 months ago
- ☆3,010Updated this week
- ☆230Updated 3 months ago
- Build & Optimize your RAG.☆671Updated 3 weeks ago
- A LLM-based Agent that predict its tasks proactively.☆370Updated 2 weeks ago
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆362Updated 3 months ago
- ☆1,321Updated 2 weeks ago
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆483Updated 2 weeks ago
- ☆884Updated 2 months ago
- 🤖 A visualization Model Context Protocol server for generating visual charts using @antvis.☆668Updated last week
- ☆458Updated 2 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆324Updated this week