xlang-ai / OpenCUALinks
OpenCUA: Open Foundations for Computer-Use Agents
☆554Updated 3 weeks ago
Alternatives and similar repositories for OpenCUA
Users that are interested in OpenCUA are comparing it to the libraries listed below
Sorting:
- AgentFlow: In-the-Flow Agentic System Optimization☆1,187Updated last week
- ☆1,082Updated 3 weeks ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆589Updated 5 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆773Updated 3 months ago
- 🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets☆657Updated last week
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆667Updated last week
- [NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆351Updated 2 weeks ago
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆456Updated last month
- MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, Brow…☆813Updated this week
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆367Updated 8 months ago
- Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.☆368Updated last week
- A Scientific Multimodal Foundation Model☆604Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆467Updated 2 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆450Updated last week
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆490Updated last month
- ☆843Updated 2 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆397Updated 6 months ago
- open-source coding LLM for software engineering tasks☆1,035Updated last month
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆704Updated last month
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.☆453Updated this week
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆280Updated 3 weeks ago
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆377Updated 4 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆485Updated 2 months ago
- LightMem: Lightweight and Efficient Memory-Augmented Generation☆347Updated this week
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆229Updated last week
- GUI Grounding for Professional High-Resolution Computer Use☆277Updated 2 weeks ago
- An open-sourced implementation for "Agentic Context Engineering (ACE)" methon from *Agentic Context Engineering: Evolving Contexts for Se…☆290Updated this week
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆284Updated 3 months ago
- ☆834Updated last month
- 👩⚖️ Coding Agent-as-a-Judge☆662Updated 5 months ago