OpenBMB / AgentCPM-GUILinks
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
☆834Updated last week
Alternatives and similar repositories for AgentCPM-GUI
Users that are interested in AgentCPM-GUI are comparing it to the libraries listed below
Sorting:
- 🌐 WebWalker [ACL2025] & WebDancer [Preprint]☆1,069Updated 2 weeks ago
- Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users☆444Updated 2 months ago
- An open-sourced end-to-end VLM-based GUI Agent☆973Updated 2 months ago
- ☆906Updated this week
- Query and Summarize your chat messages.☆988Updated 6 months ago
- ☆489Updated 4 months ago
- ☆170Updated last week
- ☆255Updated 10 months ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆278Updated 5 months ago
- A simple agent framework that's capable of browser use + mcp + auto instrument + plan + deep research + more☆279Updated 3 weeks ago
- ☆3,142Updated this week
- 🚀MCP server for accessing RedNote(XiaoHongShu, xhs).☆607Updated last month
- ☆1,463Updated this week
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆500Updated 2 weeks ago
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆487Updated last month
- Build & Optimize your RAG.☆699Updated last month
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆265Updated last month
- Baidu Map MCP Server☆293Updated last week
- ☆310Updated 6 months ago
- ☆750Updated last week
- GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆256Updated this week
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆629Updated 4 months ago
- AI design agent, local alternative for Lovart. AI agent with ability to design, edit and generate images, posters, storyboards, etc.☆634Updated this week
- The world's first Full-Stack Open-Source General AI Agent☆471Updated this week
- An LLM-based Web Navigating Agent (KDD'24)☆867Updated 8 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆506Updated 2 weeks ago
- A LLM-based Agent that predict its tasks proactively.☆378Updated last month
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,317Updated 3 weeks ago
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆454Updated 2 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆341Updated 2 months ago