xlang-ai / OpenCUALinks
OpenCUA: Open Foundations for Computer-Use Agents
☆488Updated last week
Alternatives and similar repositories for OpenCUA
Users that are interested in OpenCUA are comparing it to the libraries listed below
Sorting:
- ☆966Updated last week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆692Updated 2 months ago
- ☆810Updated last month
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆552Updated 3 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆361Updated 6 months ago
- GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆344Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆409Updated 3 weeks ago
- MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, Brow…☆644Updated this week
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆635Updated 2 weeks ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆440Updated this week
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆387Updated this week
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆442Updated last month
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆352Updated last month
- All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.☆141Updated last week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆366Updated last week
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆588Updated 3 months ago
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.☆444Updated 3 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆385Updated 5 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆262Updated last month
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆376Updated 2 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆432Updated 3 weeks ago
- open-source coding LLM for software engineering tasks☆955Updated 3 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago
- ⚖️ The First Coding Agent-as-a-Judge☆637Updated 4 months ago
- [Up-to-date] Awesome Agentic Deep Research Resources☆475Updated last month
- ☆816Updated 2 weeks ago
- A Scientific Multimodal Foundation Model☆574Updated last month
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆278Updated 2 months ago
- ☆202Updated 2 weeks ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆186Updated 2 weeks ago