☆35Jun 20, 2024Updated 2 years ago
Alternatives and similar repositories for CoCo-Agent
Users that are interested in CoCo-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Research code for the paper "OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents"☆21Jun 19, 2025Updated last year
- ☆12Aug 8, 2024Updated last year
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 11 months ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆261Jul 16, 2024Updated last year
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆103Oct 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆23Oct 11, 2024Updated last year
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated last year
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆159Jan 3, 2026Updated 6 months ago
- ☆35Jan 12, 2026Updated 5 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆143Mar 1, 2026Updated 4 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆69Jan 7, 2026Updated 5 months ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆393Feb 22, 2025Updated last year
- The model, data and code for the visual GUI Agent SeeClick☆486Jul 13, 2025Updated 11 months ago
- ☆33Sep 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆70Aug 9, 2024Updated last year
- All code and data necessary to replicate experiments in the paper BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Model…☆13Sep 16, 2024Updated last year
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆64Jul 11, 2025Updated 11 months ago
- [AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"☆157Nov 24, 2025Updated 7 months ago
- Quick access to your Zotero references from the system tray☆12Aug 1, 2011Updated 14 years ago
- Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations b…☆36Jun 27, 2024Updated 2 years ago
- gradio bbox labeling tools☆11May 12, 2023Updated 3 years ago
- [NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents☆58Nov 27, 2025Updated 7 months ago
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 使用Transformer进行中英翻译(demo)☆17Aug 25, 2023Updated 2 years ago
- Game UI Glitch Detection via Bug Understanding☆12Jul 31, 2021Updated 4 years ago
- ☆17Feb 26, 2024Updated 2 years ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆106Jan 11, 2026Updated 5 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Oct 10, 2023Updated 2 years ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆317Mar 11, 2026Updated 3 months ago
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆66Dec 4, 2025Updated 7 months ago
- GPT-4V in Wonderland: LMMs as Smartphone Agents☆134Jul 17, 2024Updated last year
- [CVPR 2026] HiconAgent: History Context-aware Policy Optimization for GUI Agents☆30Mar 9, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python codes for mathematical modeling.☆13Sep 5, 2021Updated 4 years ago
- ☆35Jan 28, 2026Updated 5 months ago
- A common cursor icon type☆18Dec 14, 2025Updated 6 months ago
- ☆37Sep 30, 2024Updated last year
- Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)☆14Jun 14, 2024Updated 2 years ago
- AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management☆31Apr 10, 2026Updated 2 months ago
- ☆15Feb 27, 2024Updated 2 years ago