R1-like Computer-use Agent
☆89Mar 21, 2025Updated 11 months ago
Alternatives and similar repositories for STEVE-R1
Users that are interested in STEVE-R1 are comparing it to the libraries listed below
Sorting:
- ☆12Jul 16, 2025Updated 7 months ago
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆27Feb 5, 2025Updated last year
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 3 months ago
- ☆20Apr 16, 2025Updated 10 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆148May 29, 2025Updated 9 months ago
- ☆20Nov 4, 2025Updated 3 months ago
- ☆30Jul 3, 2025Updated 7 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Apr 2, 2024Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆51Jul 15, 2025Updated 7 months ago
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆68Jul 24, 2025Updated 7 months ago
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆75Nov 4, 2025Updated 3 months ago
- ☆20Apr 24, 2024Updated last year
- ☆66Jul 8, 2025Updated 7 months ago
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 3 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 8 months ago
- For ACL25 paper "WAFFLE: Multi-Modal Model for Automated Front-End Development" - by Shanchao Liang and Nan Jiang and Shangshu Qian and L…☆11May 28, 2025Updated 9 months ago
- ☆26Jul 29, 2025Updated 7 months ago
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- ☆73May 23, 2025Updated 9 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 6 months ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆133Nov 4, 2025Updated 3 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- ☆34Jan 25, 2026Updated last month
- Under construction☆13Jan 15, 2025Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- GroundCUA☆68Dec 24, 2025Updated 2 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆34Nov 10, 2025Updated 3 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Dec 12, 2024Updated last year
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆34Jan 16, 2026Updated last month
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- Advancing the frontier of efficient AI☆53Feb 10, 2026Updated 2 weeks ago
- moodist☆24Feb 20, 2026Updated last week
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆17May 27, 2024Updated last year
- ☆24May 23, 2025Updated 9 months ago
- ☆23Jan 28, 2026Updated 3 weeks ago
- XmodelLM☆38Nov 19, 2024Updated last year
- [ICLR 2025] Dissecting adversarial robustness of multimodal language model agents☆124Feb 19, 2025Updated last year