[AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615
☆65Nov 8, 2025Updated 7 months ago
Alternatives and similar repositories for GUI-RCPO
Users that are interested in GUI-RCPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Aug 11, 2025Updated 10 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆48Oct 20, 2025Updated 7 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆54Nov 4, 2025Updated 7 months ago
- Official code for "KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation"☆72Updated this week
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆55May 5, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆28Aug 19, 2025Updated 9 months ago
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆323May 20, 2026Updated 3 weeks ago
- On Policy Distillation Build on top of Verl☆76May 25, 2026Updated 2 weeks ago
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆23Feb 26, 2026Updated 3 months ago
- Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents☆31Dec 6, 2024Updated last year
- ☆67Feb 27, 2026Updated 3 months ago
- ☆42Jun 18, 2025Updated 11 months ago
- ☆20Dec 20, 2025Updated 5 months ago
- A Unified Framework for High-Performance and Extensible LLM Steering☆263Apr 30, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Memory-Based Instance-Level Adaptation for Cross-Domain Object Detection☆15Jul 11, 2024Updated last year
- ☆32Feb 5, 2025Updated last year
- Artifact evaluation of MobiSys25 SynCheck☆20Mar 24, 2025Updated last year
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆50Sep 15, 2025Updated 8 months ago
- ☆12Jan 19, 2025Updated last year
- ☆17Aug 16, 2019Updated 6 years ago
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Apr 7, 2026Updated 2 months ago
- ☆18May 11, 2025Updated last year
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆40Jun 4, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tool to convert and import problems from Polygon into DOMjudge.☆35Mar 19, 2025Updated last year
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆45Jul 10, 2025Updated 11 months ago
- ☆34Jul 15, 2025Updated 10 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 6 months ago
- ☆89Dec 23, 2025Updated 5 months ago
- ☆56Mar 18, 2026Updated 2 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆52Jul 3, 2024Updated last year
- ☆34Sep 19, 2025Updated 8 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Vero: An Open RL Recipe for General Visual Reasoning☆123Jun 3, 2026Updated last week
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated 2 years ago
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".☆68Updated this week
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29May 10, 2024Updated 2 years ago
- ☆56Jul 7, 2025Updated 11 months ago
- 基于 Open-AutoGLM 框架的纯手机端(Android 原生)AI 智能助手项目,专注于实现免手操作(Hands-Free)的手机自动化 Agent☆22Jan 9, 2026Updated 5 months ago
- [ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models☆89May 20, 2025Updated last year