R1-like Computer-use Agent
☆90Mar 21, 2025Updated last year
Alternatives and similar repositories for STEVE-R1
Users that are interested in STEVE-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆159May 29, 2025Updated last year
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆28Apr 14, 2026Updated 2 months ago
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 7 months ago
- ☆21Apr 16, 2025Updated last year
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆40Jan 16, 2026Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆13Jul 16, 2025Updated 11 months ago
- 回国VPN推荐 - 2026年更新☆55Jun 3, 2026Updated 3 weeks ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆55Jul 15, 2025Updated 11 months ago
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆69Jul 24, 2025Updated 11 months ago
- ☆130Oct 3, 2025Updated 8 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated 2 years ago
- [COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?☆12Dec 3, 2024Updated last year
- Code implementation of the paper accepted by IEEE TKDE2024: "Make Heterophilic Graphs Better Fit GNN: A Graph Rewiring Approach"☆112Dec 15, 2024Updated last year
- ☆20Apr 24, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆32Jul 3, 2025Updated 11 months ago
- Under construction☆13Jan 15, 2025Updated last year
- ☆47Apr 9, 2025Updated last year
- [CVPR 2023] Ref-NPR: Reference-Based Non-PhotoRealistic Radiance Fields☆126Jul 7, 2023Updated 2 years ago
- ☆23Apr 2, 2026Updated 2 months ago
- ☆323Sep 18, 2024Updated last year
- moodist☆28Apr 23, 2026Updated 2 months ago
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆90Nov 4, 2025Updated 7 months ago
- ☆19Nov 4, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Toolkit for building AI agent for web3.☆83Jan 16, 2025Updated last year
- ☆29May 13, 2025Updated last year
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆398Jan 19, 2025Updated last year
- ☆139May 8, 2025Updated last year
- ☆24Oct 10, 2025Updated 8 months ago
- Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)☆725Sep 24, 2025Updated 9 months ago
- A Doctor for your data☆3,481Jun 16, 2026Updated 2 weeks ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Nov 6, 2025Updated 7 months ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆49Jun 2, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale☆25Jul 31, 2025Updated 11 months ago
- CVPR25☆28Jul 2, 2025Updated 11 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 10 months ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆150Nov 4, 2025Updated 7 months ago
- This project uses wrist-worn sensor data—movement, temperature, and proximity—to distinguish body-focused repetitive behaviors (BFRBs) fr…☆81Oct 6, 2025Updated 8 months ago
- ☆13Apr 13, 2026Updated 2 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆282Aug 4, 2025Updated 10 months ago