penghao-wu / GUI_ReflectionLinks
☆28Updated 4 months ago
Alternatives and similar repositories for GUI_Reflection
Users that are interested in GUI_Reflection are comparing it to the libraries listed below
Sorting:
- ☆51Updated 8 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆89Updated 7 months ago
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆105Updated 6 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆36Updated 2 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆148Updated 4 months ago
- Geometric-Mean Policy Optimization☆96Updated 2 months ago
- DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning☆166Updated 2 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆60Updated this week
- Official implementation of Browse-Master, a tool-augmented web-search agent.☆25Updated 4 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 5 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 7 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Updated 3 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆128Updated 5 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 5 months ago
- ☆20Updated 3 months ago
- ☆50Updated 11 months ago
- ☆32Updated 6 months ago
- ☆38Updated 2 months ago
- Process Reward Models That Think☆74Updated last month
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆56Updated 2 months ago
- ☆75Updated 2 months ago
- Official Repository of Native Parallel Reasoner☆96Updated last month
- ☆122Updated 3 months ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆25Updated 3 months ago
- ☆126Updated last week
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆91Updated 7 months ago
- Efficient Agent Training for Computer Use☆135Updated 4 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47Updated 8 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Updated 3 months ago