JiuTian-VL/SimpAgent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JiuTian-VL/SimpAgent)

JiuTian-VL / SimpAgent

[ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification

☆48

Alternatives and similar repositories for SimpAgent

Users that are interested in SimpAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iLearn-Lab / ACM-MM25-PUMA
View on GitHub
[ACM MM 2025] PUMA: Layer-Pruned Language Model for Efficient Unified Multimodal Retrieval with Modality-Adaptive Learning
☆18Jun 6, 2026Updated last month
iLearn-Lab / ACL26-PersonalAlign
View on GitHub
[ACL 2026 main] PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records
☆21Apr 11, 2026Updated 3 months ago
JiuTian-VL / UniEmo
View on GitHub
[TIP 2026] UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries
☆34May 7, 2026Updated 2 months ago
tiangeluo / RegionFocus
View on GitHub
A simple visual test-time scaling method for GUI agent grounding
☆26Dec 7, 2025Updated 7 months ago
iLearn-Lab / MM25-EmoSym
View on GitHub
[ACM MM 2025] Official repository of "EmoSym: A Symbiotic Framework for Unified Emotional Understanding and Generation via Latent Reasoni…
☆30May 6, 2026Updated 2 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
WenyiWU0111 / CoMEM-Agent
View on GitHub
Official repository for paper Auto-scaling Continuous Memory for GUI Agent
☆29Feb 2, 2026Updated 5 months ago
iLearn-Lab / CVPR26-ConsisVLA-4D
View on GitHub
[CVPR 2026] ConsisVLA-4D: Advancing Spatiotemporal Consistency in Efficient 3D-Perception and 4D-Reasoning for Robotic Manipulation
☆46May 8, 2026Updated 2 months ago
uivision / UI-Vision
View on GitHub
☆33Jul 3, 2025Updated last year
iLearn-Lab / AAAI26-H-GAR
View on GitHub
[AAAI 2026] H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Reffnement for Robotic Manipulation
☆32Nov 28, 2025Updated 7 months ago
UITron-hub / UITron-Speech
View on GitHub
☆21Jan 22, 2026Updated 6 months ago
iLearn-Lab / NeurIPS25-CogVLA
View on GitHub
[NeurIPS 2025] CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification
☆185Jun 17, 2026Updated last month
aburns4 / textualforesight
View on GitHub
☆12Aug 8, 2024Updated last year
UCSB-AI / Screen-Point-and-Read
View on GitHub
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
☆31May 12, 2026Updated 2 months ago
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hulianyuyy / iLLaVA
View on GitHub
iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)
☆23Jun 24, 2026Updated last month
ZJU-REAL / Awesome-GUI-Agents
View on GitHub
A curated collection of resources, tools, and frameworks for developing GUI Agents.
☆446Jul 9, 2026Updated 2 weeks ago
GuangyanS / Sys2-LLaVA
View on GitHub
☆31Feb 10, 2025Updated last year
ai-agents-2030 / ViMo
View on GitHub
☆26Apr 2, 2026Updated 3 months ago
Wuzheng02 / OS-Kairos
View on GitHub
[ACL 2025] Research code for the paper "OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents"
☆21Jun 19, 2025Updated last year
JiuTian-VL / Large-VLM-based-VLA-for-Robotic-Manipulation
View on GitHub
A curated list of large VLM-based VLA models for robotic manipulation.
☆427Apr 3, 2026Updated 3 months ago
iLearn-Lab / AAAI26-SemanticVLA
View on GitHub
[AAAI 2026 Oral] SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
☆71Apr 5, 2026Updated 3 months ago
showlab / Awesome-GUI-Agent
View on GitHub
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
☆1,199Aug 17, 2025Updated 11 months ago
xiaomi-research / guievalkit
View on GitHub
[ICML 2026] GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents
☆23Feb 26, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JiuTian-VL / Optimus-3
View on GitHub
Official Implementation for Optimus-3: Dual-Router Aligned Mixture-of-Experts Agent with Dual-Granularity Reasoning-Aware Policy Optimiza…
☆69Apr 14, 2026Updated 3 months ago
microsoft / GUI-Actor
View on GitHub
[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
☆410Apr 13, 2026Updated 3 months ago
iLearn-Lab / NeurIPS24-Optimus-1
View on GitHub
[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
☆102Jun 17, 2025Updated last year
xbmxb / EnvDistraction
View on GitHub
☆24Oct 11, 2024Updated last year
alibaba / UI-Ins
View on GitHub
Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning
☆78Apr 20, 2026Updated 3 months ago
manipulate-in-dream / MinD
View on GitHub
☆19Sep 4, 2025Updated 10 months ago
MadeAgents / mobile-use
View on GitHub
MobileUse: an open-source mobile GUI agent for Android phone automation, AndroidWorld/AndroidLab evaluation, hierarchical reflection, and…
☆168Jul 10, 2026Updated 2 weeks ago
Euphoria16 / UI-Genie
View on GitHub
[NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
☆60Nov 27, 2025Updated 7 months ago
ZJU-REAL / GUI-G2
View on GitHub
[AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding
☆310Apr 15, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
IMYangJinheng / DeepCFD-for-Prediction-of-flow-field-in-Laval-nozzle
View on GitHub
This is a U-Net-based deep learning model, which we call DeepCFD. You can use this model to predict the temperature, velocity, and pressu…
☆15Sep 30, 2025Updated 9 months ago
pkunlp-icler / MIC
View on GitHub
MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU
☆49Jul 13, 2025Updated last year
showlab / FocusUI
View on GitHub
[CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
☆35Jun 7, 2026Updated last month
InfiXAI / InfiGUI-R1
View on GitHub
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆67Dec 4, 2025Updated 7 months ago
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆862Jun 28, 2026Updated 3 weeks ago
penghao-wu / GUI_Reflection
View on GitHub
☆34Sep 19, 2025Updated 10 months ago