WukLab/osworld-human

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WukLab/osworld-human)

WukLab / osworld-human

OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents

☆27

Alternatives and similar repositories for osworld-human

Users that are interested in osworld-human are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WukLab / InferCept
View on GitHub
☆34Jun 22, 2024Updated 2 years ago
Yan98 / GTA1
View on GitHub
☆130Oct 3, 2025Updated 9 months ago
agentsea / osuniverse
View on GitHub
Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agents
☆24May 7, 2025Updated last year
bin123apple / InfantAgent
View on GitHub
[NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.
☆39Apr 23, 2026Updated 3 months ago
SALT-NLP / PopupAttack
View on GitHub
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
☆51Dec 23, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
WebChoreArena / WebChoreArena
View on GitHub
COLM2026
☆36Jul 9, 2026Updated 2 weeks ago
xlang-ai / computer-agent-arena
View on GitHub
[ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agents
☆67Feb 26, 2026Updated 4 months ago
yxuansu / Awesome_Diffusions
View on GitHub
☆17Feb 20, 2023Updated 3 years ago
VeriGUI-Team / VeriWeb
View on GitHub
VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking
☆88Jan 21, 2026Updated 6 months ago
showlab / WorldGUI
View on GitHub
Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.
☆124Jul 27, 2025Updated 11 months ago
xlang-ai / VideoAgentTrek
View on GitHub
The official repo of VideoAgentTrek
☆57Oct 24, 2025Updated 9 months ago
ai-agents-2030 / ViMo
View on GitHub
☆26Apr 2, 2026Updated 3 months ago
ServiceNow / WorkArena
View on GitHub
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
☆261Apr 25, 2026Updated 3 months ago
JunShern / few-shot-adaptation
View on GitHub
Exploring Few-Shot Adaptation of Language Models with Tables
☆25Aug 22, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SJTU-IPADS / fgnn-artifacts
View on GitHub
FGNN's artifact evaluation (EuroSys 2022)
☆18Apr 25, 2022Updated 4 years ago
web-arena-x / visualwebarena
View on GitHub
VisualWebArena is a benchmark for multimodal agents.
☆484Nov 9, 2024Updated last year
WenyiWU0111 / CoMEM-Agent
View on GitHub
Official repository for paper Auto-scaling Continuous Memory for GUI Agent
☆29Feb 2, 2026Updated 5 months ago
janekm / retrieval_comparison
View on GitHub
☆20Jun 6, 2025Updated last year
INK-USC / ReCross
View on GitHub
ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation
☆23May 1, 2022Updated 4 years ago
brownirl / rlang
View on GitHub
A Declarative Language for Expressing Partial World Knowledge to Reinforcement Learning Agents
☆17Jan 19, 2024Updated 2 years ago
microsoft / SCoRE
View on GitHub
ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing
☆31Aug 30, 2021Updated 4 years ago
microsoft / WindowsAgentArena
View on GitHub
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
☆882Apr 13, 2026Updated 3 months ago
merrymercy / Awesome-Efficient-LLM
View on GitHub
A curated list for Efficient Large Language Models
☆11Mar 25, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
GATECH-EIC / PipeGCN
View on GitHub
[ICLR 2022] "PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication" by Cheng Wan, Y…
☆34Mar 15, 2023Updated 3 years ago
facebookresearch / mbr-exec
View on GitHub
code for "Natural Language to Code Translation with Execution"
☆41Nov 2, 2022Updated 3 years ago
agentica-project / verl
View on GitHub
☆17Mar 30, 2026Updated 3 months ago
taogoddd / GPT-4V-API
View on GitHub
Self-hosted GPT-4V api
☆27Nov 6, 2023Updated 2 years ago
pixas / DECS
View on GitHub
Official implementation for ICLR 2026 Oral: Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling
☆21Mar 31, 2026Updated 3 months ago
xlang-ai / OSWorld-G
View on GitHub
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆172Jun 18, 2026Updated last month
Khang-9966 / Computer-Browser-Phone-Use-Agent-Datasets
View on GitHub
This repository hosts a collection of datasets for training and evaluating CUA / GUI agents.
☆137Jun 16, 2026Updated last month
ServiceNow / sec
View on GitHub
☆16Jul 10, 2025Updated last year
benediktstroebl / agent-evals
View on GitHub
☆27May 28, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
uivision / UI-Vision
View on GitHub
☆33Jul 3, 2025Updated last year
SalesforceAIResearch / CoAct-1
View on GitHub
CoAct-1: Computer-using Agents with Coding as Actions
☆27Jun 2, 2026Updated last month
bytedance / raylink
View on GitHub
Framework to build and train RL algorithms
☆39Oct 11, 2021Updated 4 years ago
yichuan-w / raytracer
View on GitHub
raytracer
☆10Jul 18, 2022Updated 4 years ago
mail-ecnu / Text-Gym-Agents
View on GitHub
This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate…
☆22May 29, 2024Updated 2 years ago
ServiceNow / GroundCUA
View on GitHub
GroundCUA
☆129Mar 24, 2026Updated 4 months ago
cyd3r / notify-free-gpu
View on GitHub
A telegram bot that sends you a message when the GPU is in use
☆11May 27, 2024Updated 2 years ago