xlang-ai / OSWorld-GLinks
Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆49Updated this week
Alternatives and similar repositories for OSWorld-G
Users that are interested in OSWorld-G are comparing it to the libraries listed below
Sorting:
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆68Updated this week
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 5 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆82Updated last week
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆57Updated 7 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆59Updated last month
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆101Updated 2 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆43Updated 3 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆45Updated last month
- General Reasoner: Advancing LLM Reasoning Across All Domains☆117Updated this week
- An Illusion of Progress? Assessing the Current State of Web Agents☆52Updated last week
- Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆94Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆60Updated 2 weeks ago
- ☆38Updated 5 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆136Updated 6 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆101Updated 2 months ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆74Updated last month
- official implementation of paper "Process Reward Model with Q-value Rankings"☆59Updated 3 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆98Updated 3 weeks ago
- ☆79Updated 3 weeks ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆55Updated 3 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆36Updated last week
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆56Updated 2 months ago
- ☆67Updated 2 months ago
- ☆46Updated 2 months ago
- Efficient Agent Training for Computer Use☆85Updated last week
- ☆92Updated 8 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆207Updated 3 weeks ago
- ☆145Updated last week
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆102Updated 4 months ago
- Official Repository of LatentSeek☆30Updated last week