[AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615
☆62Nov 8, 2025Updated 4 months ago
Alternatives and similar repositories for GUI-RCPO
Users that are interested in GUI-RCPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Aug 11, 2025Updated 7 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆47Oct 20, 2025Updated 5 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 4 months ago
- ☆37Oct 9, 2025Updated 5 months ago
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆43Aug 10, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆50Feb 12, 2026Updated last month
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 5 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated 9 months ago
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆19Feb 26, 2026Updated 3 weeks ago
- An Advanced Basic Math Reasoning and Overthinking Evaluation Framework for LLMs☆12Jul 8, 2025Updated 8 months ago
- [ICLR 2026] SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models☆80Mar 9, 2026Updated 2 weeks ago
- Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents☆27Dec 6, 2024Updated last year
- ☆62Feb 27, 2026Updated 3 weeks ago
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆305Feb 2, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- VisPlay: Self-Evolving Vision-Language Models☆51Feb 25, 2026Updated last month
- ☆27Jun 18, 2025Updated 9 months ago
- 哈工大《数据库系统》2018年春季课程实验☆11Jun 10, 2018Updated 7 years ago
- ☆19Dec 20, 2025Updated 3 months ago
- Official implementation of ICRA2024 paper "Sim-to-Real Grasp Detection with Global-to-Local RGB-D Adaptation"☆20May 10, 2024Updated last year
- ☆77Feb 5, 2026Updated last month
- ☆32Feb 5, 2025Updated last year
- ☆20Sep 28, 2020Updated 5 years ago
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆49Sep 15, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Artifact evaluation of MobiSys25 SynCheck☆20Mar 24, 2025Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆39Dec 31, 2024Updated last year
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Sep 25, 2025Updated 6 months ago
- The code of LLaVO☆19Oct 21, 2025Updated 5 months ago
- ☆71Dec 23, 2025Updated 3 months ago
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆45Jul 10, 2025Updated 8 months ago
- Code for "Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights" (E…☆27Nov 27, 2024Updated last year
- ☆33Jul 15, 2025Updated 8 months ago
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"☆409Jan 29, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 3 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆51Jul 3, 2024Updated last year
- ☆55Mar 18, 2026Updated last week
- ☆32Sep 19, 2025Updated 6 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 3 months ago
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆114Jul 17, 2025Updated 8 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated last year