Princeton-AI2-Lab/ZoomClick

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Princeton-AI2-Lab/ZoomClick)

Princeton-AI2-Lab / ZoomClick

A Practical Zoom-in GUI Grounding and Behavior-Based Evaluation method.

☆25

Alternatives and similar repositories for ZoomClick

Users that are interested in ZoomClick are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZJUSCL / MVP
View on GitHub
Multi-View prediction enhances GUI Grounding
☆21Feb 22, 2026Updated 5 months ago
wwfnb / Laser
View on GitHub
☆16Sep 16, 2025Updated 10 months ago
Yoonkyo / TraceForge
View on GitHub
Official code for "TraceGen: World Modeling in 3D Trace-Space Enables Learning from Cross-Embodiment Videos" (CVPR 2026)
☆18Jan 31, 2026Updated 5 months ago
vivo / DiMo-GUI
View on GitHub
[EMNLP 2025]Repository for paper "DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning"
☆30Jul 2, 2025Updated last year
AIR-DISCOVER / SCP-Diff-Toolkit
View on GitHub
(ECCV'24) Official Implementation of SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.
☆15Oct 2, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tiangeluo / RegionFocus
View on GitHub
A simple visual test-time scaling method for GUI agent grounding
☆26Dec 7, 2025Updated 7 months ago
pzs19 / LEMMA
View on GitHub
☆16Sep 4, 2025Updated 10 months ago
synvo-ai / HippoCamp
View on GitHub
A benchmark for evaluating contextual agents on realistic multimodal personal-computer environments with profiling and factual-retention …
☆29Apr 2, 2026Updated 3 months ago
manipulate-in-dream / MinD
View on GitHub
☆19Sep 4, 2025Updated 10 months ago
yfqiu-nlp / swirl
View on GitHub
Materials for paper "Self-improving World Modelling with Latent Actions"
☆20Feb 5, 2026Updated 5 months ago
YuHengsss / Q-Zoom
View on GitHub
☆15Apr 15, 2026Updated 3 months ago
VideoVLA-Project / VideoVLA
View on GitHub
[NeurIPS2025]VideoVLA: Video Generators Can Be Generalizable Robot Manipulators
☆32Jun 26, 2026Updated last month
kid-yang233 / robots
View on GitHub
The homework of robos learning base.
☆11May 23, 2023Updated 3 years ago
D2I-ai / Route
View on GitHub
ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)
☆16May 15, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dkoleber / TerrarAI
View on GitHub
A platform for reinforcement learning in Terraria
☆12Nov 20, 2019Updated 6 years ago
exporl / vlaai
View on GitHub
Decoding of the speech envelope from EEG using the VLAAI deep neural network
☆14Sep 28, 2022Updated 3 years ago
IDEA-Research / V-Reflection
View on GitHub
Related code, checkpoints and project page for V-Reflection
☆60Apr 7, 2026Updated 3 months ago
xavier-yu114 / Zoom-Refine
View on GitHub
Zoom-Refine: Boosting High-Resolution Multimodal Understanding via Localized Zoom and Self-Refinement
☆19Jul 4, 2026Updated 3 weeks ago
zs1314 / Fraesormer
View on GitHub
【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"
☆13Mar 21, 2025Updated last year
AMD-AGI / DUET-VLM
View on GitHub
DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference
☆25May 21, 2026Updated 2 months ago
Trent-Fellbootman / dev000
View on GitHub
A complete introductory course to programming, computer systems and software development (continuously updating).
☆12Feb 21, 2024Updated 2 years ago
wenyi-li / FairDiff
View on GitHub
[MICCAI 24] The official code repository for paper "FairDiff: Fair Segmentation with Point-Image Diffusion".
☆60Mar 12, 2025Updated last year
r-three / AttriBoT
View on GitHub
Code for AttriBoT from "AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution"
☆15Apr 21, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mvp-ai-lab / FreeScale
View on GitHub
The official implementation of our CVPR 2026 paper: "FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation"
☆20May 17, 2026Updated 2 months ago
om-ai-lab / ZoomEye
View on GitHub
[EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
☆91Nov 20, 2025Updated 8 months ago
supersupercong / MSGNN
View on GitHub
[IJCAI-24] Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks
☆11Sep 2, 2024Updated last year
Kroangine-Xia / Design-of-a-Gesture-Recognition-based-Robotic-Arm-Control-System
View on GitHub
☆15Jun 19, 2024Updated 2 years ago
FeiSun / LaTeX-Drawing
View on GitHub
LaTeX Drawing
☆18Dec 22, 2025Updated 7 months ago
zyzkevin / dyva-worldlm
View on GitHub
☆23Nov 18, 2025Updated 8 months ago
Han1018 / ZonUI-3B
View on GitHub
[WACV 2026] ZonUI-3B — A lightweight, resolution-aware GUI grounding model trained with only 24K samples on a single RTX 4090.
☆26Jan 2, 2026Updated 6 months ago
likaixin2000 / ScreenSpot-Pro-GUI-Grounding
View on GitHub
GUI Grounding for Professional High-Resolution Computer Use
☆383Jun 17, 2026Updated last month
automl / is_mamba_capable_of_icl
View on GitHub
☆18Apr 24, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BigTaige / MP-GUI
View on GitHub
CVPR25
☆28Jul 2, 2025Updated last year
Phil26AT / BlenderNoriPlugin
View on GitHub
Export blender scenes to the Nori educational raytracer. Proposed and used in the Computer Graphics course at ETH Zurich, Fall 2020
☆17Oct 25, 2021Updated 4 years ago
Murf-y / Attractors-Simulation
View on GitHub
Multiple Attractors simulation with customization
☆14Feb 22, 2026Updated 5 months ago
Shanka123 / MAP
View on GitHub
☆33Sep 20, 2025Updated 10 months ago
wow-world-model / wow-world-model
View on GitHub
WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine…
☆165Jan 4, 2026Updated 6 months ago
THUDM / paper-source-trace
View on GitHub
☆19Sep 29, 2024Updated last year
codepassionor / Tokenflow_adapter
View on GitHub
Expert Systems with Applications (ESWA - JCR Q1, SCI Q1)
☆13Jul 19, 2025Updated last year