THUDM / Android-LabLinks

☆227

Alternatives and similar repositories for Android-Lab

Users that are interested in Android-Lab are comparing it to the libraries listed below

Sorting:

THUDM / VisualAgentBench
Towards Large Multimodal Models as Visual Foundation Agents
☆225Updated 3 months ago
aialt / awesome-mobile-agents
✨✨Latest Papers and Datasets on Mobile and PC GUI Agent
☆129Updated 8 months ago
THUDM / WebRL
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆430Updated last month
xlang-ai / aguvis
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
☆335Updated 4 months ago
OpenGVLab / GUI-Odyssey
GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes fr…
☆123Updated 8 months ago
OSU-NLP-Group / UGround
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆262Updated 2 weeks ago
njucckevin / SeeClick
The model, data and code for the visual GUI Agent SeeClick
☆406Updated 3 weeks ago
vyokky / LLM-Brained-GUI-Agents-Survey
GitHub page for "Large Language Model-Brained GUI Agents: A Survey"
☆178Updated last month
OS-Copilot / OS-Genesis
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
☆149Updated 3 weeks ago
lll6gg / UI-R1
Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
☆122Updated 2 months ago
OpenBMB / Eurus
☆320Updated 10 months ago
IMNearth / CoAT
Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)
☆91Updated 9 months ago
DigiRL-agent / digirl
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆372Updated 5 months ago
SuperGPQA / SuperGPQA
☆157Updated 3 months ago
X-PLUG / Multi-LLM-Agent
☆229Updated last year
OSU-NLP-Group / GUI-Agents-Paper-List
Building a comprehensive and handy list of papers for GUI agents
☆442Updated last month
cooelf / Auto-GUI
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
☆244Updated last year
GAIR-NLP / PC-Agent
PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World
☆274Updated 2 months ago
google-research / android_world
AndroidWorld is an environment and benchmark for autonomous agents
☆367Updated this week
InternLM / POLAR
Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.
☆140Updated 3 weeks ago
ADaM-BJTU / O1-CODER
AN O1 REPLICATION FOR CODING
☆335Updated 7 months ago
qiancheng0 / ToolRL
☆293Updated last month
ADaM-BJTU / AutoCoA
AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…
☆121Updated 4 months ago
RUCBM / GUICourse
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆123Updated last year
QwenLM / AutoIF
☆298Updated last year
Gen-Verse / ReasonFlux
ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling
☆462Updated 2 weeks ago
likaixin2000 / ScreenSpot-Pro-GUI-Grounding
GUI Grounding for Professional High-Resolution Computer Use
☆238Updated 3 weeks ago
open-compass / T-Eval
[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
☆283Updated last year
RUC-GSAI / YuLan-Mini
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆200Updated last week
Tongyi-Zhiwen / QwenLong-L1
☆287Updated 2 months ago