niuzaisheng/ScreenExplorer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/niuzaisheng/ScreenExplorer)

niuzaisheng / ScreenExplorer

ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World

☆26

Alternatives and similar repositories for ScreenExplorer

Users that are interested in ScreenExplorer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RLHFlow / GVM
View on GitHub
☆16Jul 29, 2025Updated 11 months ago
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
ai-agents-2030 / DistRL-open
View on GitHub
☆22May 23, 2025Updated last year
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
Dongping-Chen / GUI-World
View on GitHub
(ICLR 2025) The Official Code Repository for GUI-World.
☆69Dec 18, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
OpenGVLab / ZeroGUI
View on GitHub
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆119Jul 17, 2025Updated last year
aoxy / ClassIn-Video-Download
View on GitHub
ClassIn回放视频批量下载
☆12Jun 4, 2020Updated 6 years ago
chuyg1005 / seeclick-crawler
View on GitHub
☆20Apr 24, 2024Updated 2 years ago
niuzaisheng / ScreenAgent
View on GitHub
ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)
☆607Nov 25, 2024Updated last year
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
YXB-NKU / SE-GUI
View on GitHub
[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"
☆108Oct 21, 2025Updated 9 months ago
MobileLLM / LLM-Explorer
View on GitHub
☆25Jun 1, 2026Updated last month
OSU-NLP-Group / UGround
View on GitHub
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆315Mar 11, 2026Updated 4 months ago
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Yuqi-Zhou / GUI-G1
View on GitHub
☆28Sep 15, 2025Updated 10 months ago
hiaoxui / nugget
View on GitHub
☆11Aug 1, 2024Updated last year
TongUI-agent / TongUI-agent
View on GitHub
[AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for General…
☆114Dec 1, 2025Updated 7 months ago
jylee425 / b-moca
View on GitHub
Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)
☆34Jul 21, 2025Updated last year
xlang-ai / VideoAgentTrek
View on GitHub
The official repo of VideoAgentTrek
☆57Oct 24, 2025Updated 9 months ago
DigiRL-agent / digiq
View on GitHub
☆121Apr 8, 2025Updated last year
agentsea / osuniverse
View on GitHub
Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agents
☆24May 7, 2025Updated last year
njucckevin / SeeClick
View on GitHub
The model, data and code for the visual GUI Agent SeeClick
☆493Jul 13, 2025Updated last year
ZJU-ACES-ISE / ChatUITest
View on GitHub
Under construction
☆14Jan 15, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
VeriGUI-Team / VeriWeb
View on GitHub
VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking
☆88Jan 21, 2026Updated 6 months ago
spyysalo / s800
View on GitHub
Tools for working with the S800 corpus
☆12Sep 17, 2020Updated 5 years ago
cvenhoff / vlm-mapping
View on GitHub
☆19Jun 20, 2025Updated last year
gen-robot / StreamingVLA
View on GitHub
Official repo for "StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation"
☆29Jun 29, 2026Updated 3 weeks ago
SceneDroid / SceneDroid
View on GitHub
☆17Oct 30, 2023Updated 2 years ago
oriyor / assistantbench
View on GitHub
Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"
☆71Dec 9, 2024Updated last year
aburns4 / textualforesight
View on GitHub
☆12Aug 8, 2024Updated last year
kschweig / OfflineRL
View on GitHub
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
☆26Jan 16, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
alecwangcq / f-divergence-dpo
View on GitHub
Direct preference optimization with f-divergences.
☆17Nov 3, 2024Updated last year
yanxue7 / E3T-Overcooked
View on GitHub
☆15May 4, 2024Updated 2 years ago
Ruiyang-061X / Awesome-MLLM-Reasoning
View on GitHub
📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.
☆13Feb 7, 2025Updated last year
deerishi / graph-based-semi-supervised-learning
View on GitHub
This project explores the different techniques (both scalable and non scalable) for Graph based semi supervised learning. Recent techniqu…
☆14May 28, 2016Updated 10 years ago
WukLab / osworld-human
View on GitHub
OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents
☆27May 17, 2026Updated 2 months ago
NKU-HLT / MusicEval-baseline
View on GitHub
☆12Apr 18, 2025Updated last year
MinsungHyun / Class-Imbalanced-Semi-Supervised-Learning
View on GitHub
Class-Imbalanced Semi-Supervised Learning code
☆11Aug 18, 2021Updated 4 years ago