OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents
☆21Jan 6, 2026Updated 2 months ago
Alternatives and similar repositories for osworld-human
Users that are interested in osworld-human are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆35Feb 25, 2026Updated last week
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆26Feb 17, 2026Updated 2 weeks ago
- ☆17Feb 20, 2023Updated 3 years ago
- Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agents☆24May 7, 2025Updated 9 months ago
- ☆31Jul 3, 2025Updated 8 months ago
- ☆32Aug 17, 2025Updated 6 months ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing☆31Aug 30, 2021Updated 4 years ago
- Self-hosted GPT-4V api☆27Nov 6, 2023Updated 2 years ago
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- raytracer☆10Jul 18, 2022Updated 3 years ago
- Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"☆35May 26, 2024Updated last year
- ☆41Jul 21, 2024Updated last year
- A benchmark of Python Library Migration☆14Apr 5, 2025Updated 11 months ago
- ☆13Oct 19, 2023Updated 2 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆19Jan 6, 2026Updated 2 months ago
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- ☆10Jul 13, 2024Updated last year
- Our repo containes a Efficient RGB-D features extractor to category-level and instance-level 6D pose estimation.☆14Oct 29, 2025Updated 4 months ago
- Anchored Diffusion Language Model (NeurIPS 2025)☆27Oct 13, 2025Updated 4 months ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- ☆13Aug 4, 2025Updated 7 months ago
- ☆10May 20, 2019Updated 6 years ago
- [2023 CoRL] Leveraging 3D Reconstruction for Mechanical Search on Cluttered Shelves☆11Dec 12, 2024Updated last year
- Initial commit☆12Aug 14, 2023Updated 2 years ago
- Lucene open-domain QA retrieval in python☆11Feb 18, 2021Updated 5 years ago
- A CW20 token sale template dApp.☆10Sep 29, 2021Updated 4 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- A DFS-based maze generator and solver.☆10Feb 14, 2018Updated 8 years ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Jan 16, 2024Updated 2 years ago
- ☆31Sep 19, 2025Updated 5 months ago
- Real-time terminal dashboard for Polymarket BTC 15-min UP/DOWN prediction markets. Aggregates Chainlink oracle, Binance price feeds, and …☆23Jan 31, 2026Updated last month
- The public reproducible analysis code used for the gaze project☆11Feb 21, 2026Updated last week
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated 2 months ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- a simple variational auto encoder with some exploration☆12Nov 22, 2024Updated last year
- distill large scale web page text☆12Jul 29, 2023Updated 2 years ago