ServiceNow/GroundCUA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ServiceNow/GroundCUA)

ServiceNow / GroundCUA

GroundCUA

☆129

Alternatives and similar repositories for GroundCUA

Users that are interested in GroundCUA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / Pitfalls-of-Memorization
View on GitHub
Understanding the interplay between memorization and generalization in neural networks, featuring MAT, a learning algorithm to enhance ro…
☆40Dec 19, 2024Updated last year
uivision / UI-Vision
View on GitHub
☆33Jul 3, 2025Updated last year
taco-group / Pulse-of-Motion
View on GitHub
The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics
☆71Mar 26, 2026Updated 3 months ago
xlang-ai / VideoAgentTrek
View on GitHub
The official repo of VideoAgentTrek
☆57Oct 24, 2025Updated 8 months ago
The-AI-Alliance / cube-standard
View on GitHub
Standardize benchmark wrapping so the community can wrap various otherwise-incompatible benchmarks uniformly and use them everywhere.
☆52Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
xlang-ai / OSWorld-V2
View on GitHub
OSWorld 2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks
☆196Jul 9, 2026Updated last week
xlang-ai / OSWorld-G
View on GitHub
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆172Jun 18, 2026Updated last month
McGill-NLP / safearena
View on GitHub
SafeArena is a benchmark for assessing the harmful capabilities of web agents
☆24Apr 23, 2025Updated last year
alibaba-damo-academy / Lumos-Custom
View on GitHub
[ICLR-26, ECCV-26, NeurIPS-25] Lumos-Custom Project: research for customized video generation in the Lumos Project.
☆216Jun 29, 2026Updated 3 weeks ago
ysy31415 / EffectMaker
View on GitHub
Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
☆42Mar 6, 2026Updated 4 months ago
Coral79 / ActionPlan-Code
View on GitHub
[Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning
☆91Jul 6, 2026Updated 2 weeks ago
Yan98 / GTA1
View on GitHub
☆130Oct 3, 2025Updated 9 months ago
showlab / showui-pi
View on GitHub
[CVPR 2026] ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands
☆128Apr 22, 2026Updated 2 months ago
McGill-NLP / AdversarialTriggers
View on GitHub
TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models
☆19Aug 17, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
McGill-NLP / retriever-lm-reasoning
View on GitHub
Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…
☆28Nov 2, 2023Updated 2 years ago
cvg / megaflow
View on GitHub
MegaFlow: Zero-Shot Large Displacement Optical Flow
☆140Mar 28, 2026Updated 3 months ago
ServiceNow / drbench
View on GitHub
An enterprise deep research benchmark
☆40Apr 22, 2026Updated 2 months ago
xlang-ai / CUA-Gym
View on GitHub
Scalable pipeline for synthesizing verifiable RLVR training data for computer-use agents
☆179May 26, 2026Updated last month
tianyu-z / VCR
View on GitHub
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
☆32Feb 26, 2025Updated last year
snowflakewang / CustomX
View on GitHub
[ECCV 2026] CustomX: Unified Character, Action, and Scene Customization in Video World Models
☆96Jun 25, 2026Updated 3 weeks ago
lukasHoel / video_to_world
View on GitHub
Our method reconstructs 3D worlds from video diffusion models using non-rigid alignment to resolve inherent 3D inconsistencies in the gen…
☆273Apr 27, 2026Updated 2 months ago
FanbinLu / STEVE-R1
View on GitHub
R1-like Computer-use Agent
☆91Mar 21, 2025Updated last year
handx-project / HandX
View on GitHub
[CVPR 2026] HandX: Scaling Bimanual Motion and Interaction Generation
☆139Jul 1, 2026Updated 2 weeks ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
facebookresearch / XRM
View on GitHub
Discovering environments with XRM
☆18Dec 6, 2024Updated last year
meituan / EvoCUA
View on GitHub
EvoCUA: Evolving Computer Use Agent
☆332Mar 31, 2026Updated 3 months ago
mo230761 / UniGeo
View on GitHub
A framework for camera-controllable image editing using unified geometric guidance and video models.
☆65Jun 25, 2026Updated 3 weeks ago
showlab / macosworld
View on GitHub
☆35Jan 28, 2026Updated 5 months ago
AIGeeksGroup / UniMesh
View on GitHub
UniMesh: Unifying 3D Mesh Understanding and Generation
☆57Jul 14, 2026Updated last week
OSU-NLP-Group / Explorer
View on GitHub
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
☆29Feb 17, 2026Updated 5 months ago
FudanCVL / PSDesigner
View on GitHub
[CVPR 2026] PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow
☆149Mar 28, 2026Updated 3 months ago
FudanCVL / GlyphPrinter
View on GitHub
[CVPR 2026 Highlight] GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering
☆104Apr 9, 2026Updated 3 months ago
likaixin2000 / MMCode
View on GitHub
[EMNLP 2024] Multi-modal reasoning problems via code generation.
☆28Apr 14, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Tencent / POINTS-GUI
View on GitHub
☆46Feb 9, 2026Updated 5 months ago
showlab / ShowUI-Aloha
View on GitHub
Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.
☆317Jan 20, 2026Updated 6 months ago
sk-adapter / SK-Adapter
View on GitHub
[ECCV2026] Official repo for paper "SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation".
☆62Jun 26, 2026Updated 3 weeks ago
OSU-NLP-Group / Mind2Web-2
View on GitHub
[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
☆111May 17, 2026Updated 2 months ago
InfiXAI / InfiGUI-G1
View on GitHub
[AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic al…
☆148Nov 19, 2025Updated 8 months ago
WukLab / osworld-human
View on GitHub
OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents
☆27May 17, 2026Updated 2 months ago
open-compass / MMBench-GUI
View on GitHub
Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…
☆112Sep 8, 2025Updated 10 months ago