showlab/computer_use_ootb

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/showlab/computer_use_ootb)

showlab / computer_use_ootb

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

☆1,956

Alternatives and similar repositories for computer_use_ootb

Users that are interested in computer_use_ootb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

showlab / ShowUI
View on GitHub
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
☆1,886Apr 24, 2026Updated 2 months ago
ranpox / awesome-computer-use
View on GitHub
This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.
☆572Apr 15, 2026Updated 3 months ago
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,198Aug 17, 2025Updated 11 months ago
showlab / WorldGUI
View on GitHub
Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.
☆124Jul 27, 2025Updated 11 months ago
OS-Copilot / OS-Atlas
View on GitHub
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
☆452Apr 20, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
e2b-dev / open-computer-use
View on GitHub
AI computer use powered by open source LLMs and E2B Desktop Sandbox
☆2,162Jul 9, 2026Updated 2 weeks ago
microsoft / WindowsAgentArena
View on GitHub
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
☆881Apr 13, 2026Updated 3 months ago
deedy / mac_computer_use
View on GitHub
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
☆881Dec 16, 2024Updated last year
suitedaces / computer-agent
View on GitHub
Desktop app to control your computer with AI using your terminal, browser, mouse & keyboard
☆672Updated this week
njucckevin / SeeClick
View on GitHub
The model, data and code for the visual GUI Agent SeeClick
☆493Jul 13, 2025Updated last year
AmberSahdev / Open-Interface
View on GitHub
Control Any Computer Using LLMs.
☆2,699Jul 17, 2026Updated last week
xlang-ai / OSWorld
View on GitHub
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
☆3,033Updated this week
microsoft / OmniParser
View on GitHub
A simple screen parsing tool towards pure vision based GUI agent
☆25,188Updated this week
corbt / agent.exe
View on GitHub
☆3,478Nov 15, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
OSU-NLP-Group / UGround
View on GitHub
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆315Mar 11, 2026Updated 4 months ago
bytedance / UI-TARS
View on GitHub
Pioneering Automated GUI Interaction with Native Agents
☆11,218Jan 27, 2026Updated 5 months ago
AriaUI / Aria-UI
View on GitHub
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
☆406Feb 8, 2025Updated last year
simular-ai / Agent-S
View on GitHub
Agent S: an open agentic framework that uses computers like a human
☆12,055May 13, 2026Updated 2 months ago
OS-Copilot / OS-Copilot
View on GitHub
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
☆1,786Sep 9, 2024Updated last year
zai-org / CogAgent
View on GitHub
An open-sourced end-to-end VLM-based GUI Agent
☆1,189Apr 4, 2025Updated last year
xlang-ai / aguvis
View on GitHub
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
☆389Mar 7, 2025Updated last year
trycua / acu
View on GitHub
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
☆1,717Sep 26, 2025Updated 9 months ago
OSU-NLP-Group / SeeAct
View on GitHub
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…
☆851Feb 3, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
THUDM / WebRL
View on GitHub
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆535Jun 6, 2025Updated last year
browser-use / browser-use
View on GitHub
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
☆106,313Updated this week
Upsonic / Upsonic
View on GitHub
Build autonomous AI agents in Python.
☆7,918Jun 18, 2026Updated last month
showlab / Impossible-Videos
View on GitHub
ICML 2025 - Impossible Videos
☆81Jul 23, 2025Updated last year
X-PLUG / MobileAgent
View on GitHub
Mobile-Agent: The Powerful GUI Agent Family
☆8,975Jul 7, 2026Updated 2 weeks ago
openai / swarm
View on GitHub
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
☆21,859Apr 15, 2026Updated 3 months ago
OthersideAI / self-operating-computer
View on GitHub
A framework to enable a multimodal model to operate a computer.
☆10,250Sep 19, 2025Updated 10 months ago
bytedance / UI-TARS-desktop
View on GitHub
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
☆38,203Jul 1, 2026Updated 3 weeks ago
showlab / ROICtrl
View on GitHub
Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation
☆110Apr 16, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
anthropics / claude-quickstarts
View on GitHub
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
☆17,296Updated this week
DigiRL-agent / digirl
View on GitHub
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆393Feb 22, 2025Updated last year
browser-use / web-ui
View on GitHub
🖥️ Run AI Agent in your browser.
☆16,239May 15, 2026Updated 2 months ago
BAAI-Agents / Cradle
View on GitHub
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…
☆2,558Nov 7, 2024Updated last year
showlab / EvolveDirector
View on GitHub
[NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
☆52Oct 14, 2024Updated last year
yuruotong1 / autoMate
View on GitHub
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…
☆3,937Apr 30, 2026Updated 2 months ago
camel-ai / owl
View on GitHub
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
☆20,060Updated this week