[NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"
☆55May 21, 2025Updated last year
Alternatives and similar repositories for Web-Shepherd
Users that are interested in Web-Shepherd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…☆18Jul 22, 2025Updated 11 months ago
- ☆12Aug 8, 2024Updated last year
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆28Oct 10, 2025Updated 8 months ago
- Official repo for StyleMe3D☆30Apr 22, 2025Updated last year
- Under construction☆13Jan 15, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Jul 4, 2024Updated last year
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆31Oct 8, 2023Updated 2 years ago
- ☆70Mar 6, 2025Updated last year
- ☆22May 3, 2025Updated last year
- [ECML-PKDD2025] Visual Tree Search of Web Agent☆37Jul 18, 2025Updated 11 months ago
- ☆18Jun 13, 2025Updated last year
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆47Aug 7, 2025Updated 10 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆57Nov 5, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.☆13May 16, 2025Updated last year
- ☆28Aug 19, 2025Updated 10 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- [ICLR 2026] AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆49Apr 17, 2026Updated 2 months ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆35Jun 23, 2025Updated last year
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆18Nov 24, 2024Updated last year
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- Official code repository for "Web Agents with World Models [ICLR 2025]".☆30Mar 2, 2025Updated last year
- ☆14May 8, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆155May 14, 2025Updated last year
- Advanced Embodied Intelligence Brain Model☆37Nov 5, 2025Updated 7 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆24Mar 18, 2025Updated last year
- The codebase for "Learning from Easy to Complex: Adaptive Multi-curricula Learning for Neural Dialogue Generation" (Cai et al., AAAI 2020…☆20Jun 18, 2024Updated 2 years ago
- ☆10Oct 8, 2021Updated 4 years ago
- Official code repo for NeurIPS 2025 Spotlight paper, "Debate or Vote: Which Yields Better Decisions in Multi-Agent LLMs?"☆79Oct 15, 2025Updated 8 months ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆14Jun 16, 2023Updated 3 years ago
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆63Apr 7, 2026Updated 2 months ago
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Responsible Robotic Manipulation☆16Aug 31, 2025Updated 10 months ago
- ☆11Jul 21, 2024Updated last year
- 基于PyTorch GPT-2的针对各种数据并行pretrain的研究代码.☆11Dec 16, 2022Updated 3 years ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆32Jun 3, 2025Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆149Nov 26, 2024Updated last year
- Collection of papers about video-audio understanding☆25Dec 26, 2025Updated 6 months ago
- (ICLR 2025) The Official Code Repository for GUI-World.☆69Dec 18, 2024Updated last year