OpenBMB / AgentCPM-GUILinks
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
☆1,168Updated 6 months ago
Alternatives and similar repositories for AgentCPM-GUI
Users that are interested in AgentCPM-GUI are comparing it to the libraries listed below
Sorting:
- Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users☆581Updated 8 months ago
- MAI-UI: Real-World Centric Foundation GUI Agents.☆1,296Updated this week
- An open-sourced end-to-end VLM-based GUI Agent☆1,113Updated 9 months ago
- ReMe: Memory Management Kit for Agents - Remember Me, Refine Me.☆825Updated last week
- ☆85Updated 3 weeks ago
- ☆1,180Updated 2 months ago
- ☆263Updated this week
- An LLM-based Web Navigating Agent (KDD'24)☆918Updated last year
- [EMNLP 2025 Oral] MemoryOS is designed to provide a memory operating system for personalized AI agents.☆969Updated 3 months ago
- ☆847Updated 2 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆619Updated 6 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆306Updated 7 months ago
- EverMemOS is an open-source, enterprise-grade intelligent memory system. Our mission is to build AI memory that never forgets, making eve…☆1,469Updated 3 weeks ago
- A minimal yet professional single agent demo project that showcases the core execution pipeline and production-grade features of agents.☆1,080Updated this week
- A LLM-based Agent that predict its tasks proactively.☆466Updated 4 months ago
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆650Updated 10 months ago
- ☆294Updated last year
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,636Updated 7 months ago
- [NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆369Updated 2 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,216Updated 4 months ago
- A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine lang…☆1,266Updated 9 months ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆291Updated 5 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆190Updated last year
- Build, evaluate and train General Multi-Agent Assistance with ease☆1,085Updated this week
- AndroidWorld is an environment and benchmark for autonomous agents☆572Updated last month
- GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning☆2,096Updated 3 weeks ago
- VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automat…☆101Updated 5 months ago
- A complete 7-layer intelligent memory system for AI Agents with multi-modal memory fusion also support context_engineering☆133Updated 6 months ago
- ☆341Updated 2 months ago
- ☆1,959Updated this week