ZJU-REAL / gui-rcpoView external linksLinks
[AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615
☆59Nov 8, 2025Updated 3 months ago
Alternatives and similar repositories for gui-rcpo
Users that are interested in gui-rcpo are comparing it to the libraries listed below
Sorting:
- ☆32Aug 11, 2025Updated 6 months ago
- ☆36Oct 9, 2025Updated 4 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆45Oct 20, 2025Updated 3 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 3 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆39Sep 30, 2025Updated 4 months ago
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆42Aug 10, 2025Updated 6 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆68Jan 8, 2026Updated last month
- ☆24Aug 19, 2025Updated 5 months ago
- ☆19Dec 20, 2025Updated last month
- Artifact evaluation of MobiSys25 SynCheck☆19Mar 24, 2025Updated 10 months ago
- ☆25Jun 18, 2025Updated 7 months ago
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆49Sep 15, 2025Updated 5 months ago
- ☆33Jul 15, 2025Updated 7 months ago
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆34Sep 25, 2025Updated 4 months ago
- ☆55Updated this week
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆44Jul 10, 2025Updated 7 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 2 months ago
- ☆50Sep 18, 2025Updated 4 months ago
- VisPlay: Self-Evolving Vision-Language Models☆44Updated this week
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated 8 months ago
- ☆36Feb 2, 2026Updated 2 weeks ago
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆37Jun 4, 2025Updated 8 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated last year
- ☆54Jul 7, 2025Updated 7 months ago
- Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning…☆46Aug 4, 2025Updated 6 months ago
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆109Jul 17, 2025Updated 6 months ago
- nof0 of AI-Trader,A 股,港股,美股自动交易,并实时跟踪☆74Nov 5, 2025Updated 3 months ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆45Sep 19, 2025Updated 4 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆44Sep 27, 2025Updated 4 months ago
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)☆82Jul 26, 2025Updated 6 months ago
- [ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models☆90May 20, 2025Updated 8 months ago
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"☆401Jan 29, 2026Updated 2 weeks ago
- ☆100Apr 1, 2025Updated 10 months ago
- Your command-line, context-aware chatbot for instant codebase insights & more ✨☆16May 30, 2024Updated last year
- ☆12Nov 21, 2025Updated 2 months ago
- 🔒 World's most secure P2P messenger. End-to-end encrypted, zero-server architecture, quantum-resistant roadmap. WebRTC direct connection…☆24Jan 7, 2026Updated last month
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29May 10, 2024Updated last year
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆56Jun 21, 2025Updated 7 months ago