[NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"
☆53May 21, 2025Updated 10 months ago
Alternatives and similar repositories for Web-Shepherd
Users that are interested in Web-Shepherd are comparing it to the libraries listed below
Sorting:
- [ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…☆17Jul 22, 2025Updated 8 months ago
- ☆12Aug 8, 2024Updated last year
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆27Oct 10, 2025Updated 5 months ago
- Official repo for StyleMe3D☆28Apr 22, 2025Updated 11 months ago
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆30Oct 8, 2023Updated 2 years ago
- ☆68Mar 6, 2025Updated last year
- ☆22May 3, 2025Updated 10 months ago
- ☆13Aug 4, 2025Updated 7 months ago
- [ECML-PKDD2025] Visual Tree Search of Web Agent☆37Jul 18, 2025Updated 8 months ago
- ☆18Jun 13, 2025Updated 9 months ago
- ☆11Dec 6, 2024Updated last year
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)☆11Nov 15, 2023Updated 2 years ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Aug 7, 2025Updated 7 months ago
- ☆14Dec 25, 2024Updated last year
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆53Nov 5, 2024Updated last year
- [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.☆13May 16, 2025Updated 10 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- Official code repository for "Web Agents with World Models [ICLR 2025]".☆29Mar 2, 2025Updated last year
- ☆25Aug 19, 2025Updated 7 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆39Oct 7, 2025Updated 5 months ago
- ☆31Sep 27, 2024Updated last year
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆16Nov 24, 2024Updated last year
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- ☆13May 8, 2024Updated last year
- UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Y…☆16Sep 30, 2024Updated last year
- CoV: Chain-of-View Prompting for Spatial Reasoning☆52Jan 23, 2026Updated 2 months ago
- ☆21Jan 15, 2026Updated 2 months ago
- Advanced Embodied Intelligence Brain Model☆34Nov 5, 2025Updated 4 months ago
- The codebase for "Learning from Easy to Complex: Adaptive Multi-curricula Learning for Neural Dialogue Generation" (Cai et al., AAAI 2020…☆20Jun 18, 2024Updated last year
- [부스트캠프] 귀가노니 - 출퇴근길에 듣는 인공지능 뉴스 팟캐스트☆12Feb 28, 2022Updated 4 years ago
- ☆10Oct 8, 2021Updated 4 years ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆13Jun 16, 2023Updated 2 years ago
- [ICCV 2025] AdsQA: Towards Advertisement Video Understanding Arxiv: https://arxiv.org/abs/2509.08621☆34Oct 30, 2025Updated 4 months ago
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated 9 months ago
- ☆11Jul 21, 2024Updated last year
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 9 months ago
- 基于PyTorch GPT-2的针对各种数据并行pretrain的研究代码.☆11Dec 16, 2022Updated 3 years ago