[AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615
☆64Nov 8, 2025Updated 6 months ago
Alternatives and similar repositories for GUI-RCPO
Users that are interested in GUI-RCPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Aug 11, 2025Updated 9 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆48Oct 20, 2025Updated 7 months ago
- ☆37Oct 9, 2025Updated 7 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 6 months ago
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆45Aug 10, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆54May 5, 2026Updated 2 weeks ago
- ☆28Aug 19, 2025Updated 9 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 7 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆76Mar 9, 2026Updated 2 months ago
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆296Updated this week
- [ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139☆79Nov 10, 2025Updated 6 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated 11 months ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆418May 13, 2026Updated last week
- [ICLR 2026] SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models☆95Mar 9, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents☆30Dec 6, 2024Updated last year
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆309Apr 15, 2026Updated last month
- 哈工大《数据库系统》2018年春季课程实验☆11Jun 10, 2018Updated 7 years ago
- A Unified Framework for High-Performance and Extensible LLM Steering☆258Apr 30, 2026Updated 3 weeks ago
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆48Sep 15, 2025Updated 8 months ago
- Artifact evaluation of MobiSys25 SynCheck☆20Mar 24, 2025Updated last year
- ☆18May 11, 2025Updated last year
- Tool to convert and import problems from Polygon into DOMjudge.☆35Mar 19, 2025Updated last year
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆45Jul 10, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆84Dec 23, 2025Updated 5 months ago
- ☆34Jul 15, 2025Updated 10 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 5 months ago
- ☆56Mar 18, 2026Updated 2 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆51Jul 3, 2024Updated last year
- ☆34Sep 19, 2025Updated 8 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated 2 years ago
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".☆67May 18, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29May 10, 2024Updated 2 years ago
- ☆56Jul 7, 2025Updated 10 months ago
- 基于 Open-AutoGLM 框架的纯手机端(Android 原生)AI 智能助手项目,专注于实现免手操作(Hands-Free)的手机自动化 Agent☆22Jan 9, 2026Updated 4 months ago
- [ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models☆89May 20, 2025Updated last year
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆49Sep 19, 2025Updated 8 months ago
- A small storytelling LLM running on the PS Vita☆29Jun 12, 2025Updated 11 months ago
- ☆102Feb 4, 2026Updated 3 months ago