R1-like Computer-use Agent
☆89Mar 21, 2025Updated 11 months ago
Alternatives and similar repositories for STEVE-R1
Users that are interested in STEVE-R1 are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆28Feb 5, 2025Updated last year
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 4 months ago
- ☆20Apr 16, 2025Updated 11 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆153May 29, 2025Updated 9 months ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆36Jan 16, 2026Updated 2 months ago
- ☆12Jul 16, 2025Updated 8 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆52Jul 15, 2025Updated 8 months ago
- GroundCUA☆69Mar 11, 2026Updated last week
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆68Jul 24, 2025Updated 7 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Apr 2, 2024Updated last year
- Code implementation of the paper accepted by IEEE TKDE2024: "Make Heterophilic Graphs Better Fit GNN: A Graph Rewiring Approach"☆111Dec 15, 2024Updated last year
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 10 months ago
- ☆20Apr 24, 2024Updated last year
- Generating Daylight-driven Architectural Design via Diffusion Models☆22Feb 9, 2025Updated last year
- ☆31Jul 3, 2025Updated 8 months ago
- Under construction☆13Jan 15, 2025Updated last year
- ☆47Apr 9, 2025Updated 11 months ago
- [CVPR 2023] Ref-NPR: Reference-Based Non-PhotoRealistic Radiance Fields☆126Jul 7, 2023Updated 2 years ago
- ☆321Sep 18, 2024Updated last year
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆79Nov 4, 2025Updated 4 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Dec 12, 2024Updated last year
- ☆20Oct 10, 2025Updated 5 months ago
- ☆20Nov 4, 2025Updated 4 months ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆392Jan 19, 2025Updated last year
- ☆133May 8, 2025Updated 10 months ago
- ☆26Mar 10, 2026Updated last week
- A Doctor for your data☆3,488Jan 14, 2025Updated last year
- [ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale☆25Jul 31, 2025Updated 7 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆45Nov 6, 2025Updated 4 months ago
- CVPR25☆27Jul 2, 2025Updated 8 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 7 months ago
- ☆35Jan 25, 2026Updated last month
- ☆13Aug 7, 2025Updated 7 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆277Aug 4, 2025Updated 7 months ago
- [ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning"☆51Feb 14, 2026Updated last month
- [CVPR 2026] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence☆66Jul 9, 2025Updated 8 months ago
- Long Context Transfer from Language to Vision☆402Mar 18, 2025Updated last year
- (ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆49Jun 4, 2025Updated 9 months ago
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆149Jan 3, 2026Updated 2 months ago