[ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification
☆49Mar 12, 2026Updated 2 months ago
Alternatives and similar repositories for SimpAgent
Users that are interested in SimpAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] HiconAgent: History Context-aware Policy Optimization for GUI Agents☆29Mar 9, 2026Updated 2 months ago
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆21Feb 26, 2026Updated 2 months ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆30Mar 6, 2026Updated 2 months ago
- ☆34Sep 19, 2025Updated 8 months ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆30May 12, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR 2026] WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving☆42Mar 31, 2026Updated last month
- ☆67Sep 6, 2025Updated 8 months ago
- ☆32Sep 27, 2024Updated last year
- Latest Papers, Codes and Datasets on VTG-LLMs.☆90Nov 17, 2025Updated 6 months ago
- ☆21Jun 18, 2025Updated 11 months ago
- 包含作业代码及代码分析☆10Aug 13, 2021Updated 4 years ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆245May 5, 2025Updated last year
- finetune your florence2 model easy☆21Jul 27, 2024Updated last year
- ☆52Jul 6, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆65Dec 4, 2025Updated 5 months ago
- [NeurIPS 2025] CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification☆172Dec 10, 2025Updated 5 months ago
- CaMML:Context-Aware MultiModal Learner for Large Models (ACL 2024 SAC Award)☆15May 21, 2025Updated last year
- ☆17Oct 30, 2023Updated 2 years ago
- [CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy☆25Jun 17, 2025Updated 11 months ago
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- Union-set Multi-source Model Adaptation for Semantic Segmentation☆12Oct 24, 2022Updated 3 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆18Mar 15, 2021Updated 5 years ago
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆119Jul 17, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Environments, tools, and benchmarks for general computer agents☆15Dec 3, 2024Updated last year
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆35Jul 18, 2025Updated 10 months ago
- https://algo.weixin.qq.com/☆14Mar 7, 2023Updated 3 years ago
- [NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"☆103Oct 21, 2025Updated 7 months ago
- A simple visual test-time scaling method for GUI agent grounding☆25Dec 7, 2025Updated 5 months ago
- ☆16May 30, 2025Updated 11 months ago
- DomainPlus: Cross-Transform Domain Learning towards High Dynamic Range Imaging☆12Oct 11, 2022Updated 3 years ago
- Nex General Agentic Data Pipeline, an end-to-end pipeline for generating high-quality agentic training data.☆33Nov 19, 2025Updated 6 months ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆20Nov 24, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Self-Supervised Dataset Distillation for Transfer Learning☆18Apr 10, 2024Updated 2 years ago
- DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents☆24Aug 4, 2025Updated 9 months ago
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 11 months ago
- An official implementation for MS-DETR in ACL'23☆17Jun 3, 2023Updated 2 years ago
- Parallel_Computer_Architecture经典书籍☆17May 13, 2022Updated 4 years ago
- [IEEE TVCG 2025] Self-supervised Learning of Event-guided Video Frame Interpolation for Rolling Shutter Frames☆11Jun 1, 2025Updated 11 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆17Oct 20, 2025Updated 7 months ago