xlang-ai / OpenCUALinks
OpenCUA: Open Foundations for Computer-Use Agents
☆458Updated last week
Alternatives and similar repositories for OpenCUA
Users that are interested in OpenCUA are comparing it to the libraries listed below
Sorting:
- ☆867Updated last week
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆354Updated 6 months ago
- GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆331Updated last month
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆601Updated this week
- ☆800Updated last week
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆376Updated 4 months ago
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.☆433Updated 3 months ago
- Build, manage, and scale your AI agents with ease.☆451Updated last week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆649Updated last month
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆273Updated last month
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆369Updated 2 months ago
- The evaluation benchmark on MCP servers☆198Updated last week
- ☆387Updated this week
- ⚖️ The First Coding Agent-as-a-Judge☆619Updated 3 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆384Updated 2 weeks ago
- GUI Grounding for Professional High-Resolution Computer Use☆251Updated last month
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆549Updated 3 months ago
- Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.☆764Updated 4 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆251Updated 2 weeks ago
- Code and data for the Chain-of-Draft (CoD) paper☆325Updated 6 months ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆585Updated 2 months ago
- 🚀 MassGen: An Open-source Multi-Agent Scaling System Inspired by Grok Heavy and Gemini Deep Think. Join the discord channel: https://dis…☆419Updated this week
- A Scientific Multimodal Foundation Model☆561Updated last week
- Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs☆1,230Updated this week
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆333Updated 2 weeks ago
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆409Updated last week
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆218Updated 2 months ago
- [Up-to-date] Awesome Agentic Deep Research Resources☆437Updated 2 weeks ago
- Agentic Web: Weaving the Next Web with AI Agents.☆342Updated 2 weeks ago
- ☆66Updated 3 months ago