OpenBMB / AgentCPM-GUILinks
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
☆1,081Updated 4 months ago
Alternatives and similar repositories for AgentCPM-GUI
Users that are interested in AgentCPM-GUI are comparing it to the libraries listed below
Sorting:
- Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users☆531Updated 6 months ago
- An open-sourced end-to-end VLM-based GUI Agent☆1,066Updated 6 months ago
- ReMe: Memory Management Framework for Agents - Remember Me, Refine Me.☆545Updated 3 weeks ago
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,509Updated 4 months ago
- ☆285Updated last year
- An LLM-based Web Navigating Agent (KDD'24)☆892Updated last year
- ☆1,033Updated last week
- ☆219Updated this week
- ☆821Updated last month
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆639Updated 7 months ago
- [EMNLP 2025 Oral] MemoryOS is designed to provide a memory operating system for personalized AI agents.☆789Updated last month
- ☆76Updated 2 months ago
- MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, Brow…☆741Updated last week
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆578Updated 4 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆292Updated 3 months ago
- A LLM-based Agent that predict its tasks proactively.☆428Updated last month
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆291Updated 4 months ago
- ☆1,790Updated 3 weeks ago
- MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)☆1,922Updated this week
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆500Updated 4 months ago
- 🚀MCP server for accessing RedNote(XiaoHongShu, xhs).☆881Updated 5 months ago
- 这是一款基于 Playwright 开发的小红书自动搜索和评论工具,作为 MCP Server,可通过特定配置接入 MCP Client(如Claude for Desktop),帮助用户自动完成登录小红书、搜索关键词、获取笔记内容及发布AI生成评论等操作。☆301Updated 3 months ago
- Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving☆372Updated 2 months ago
- ☆337Updated last week
- GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning☆1,710Updated this week
- Query and Summarize your chat messages.☆1,020Updated 10 months ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆286Updated 2 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:阿里云金融大模型)☆366Updated last week
- GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆345Updated 2 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆563Updated 4 months ago