kyle8581/Web-Shepherd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyle8581/Web-Shepherd)

kyle8581 / Web-Shepherd

[NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"

☆58

Alternatives and similar repositories for Web-Shepherd

Users that are interested in Web-Shepherd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aburns4 / textualforesight
View on GitHub
☆12Aug 8, 2024Updated last year
SGI-2023 / 3D-Building-Classification
View on GitHub
☆10Aug 16, 2024Updated last year
LFhase / HIGHT
View on GitHub
[ICML 2025] Hierarchical Graph Tokenization for Molecule-Language Alignment
☆16Aug 18, 2025Updated 11 months ago
ZJU-ACES-ISE / ChatUITest
View on GitHub
Under construction
☆13Jan 15, 2025Updated last year
amazon-science / PAE
View on GitHub
☆70Mar 6, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wade3han / normlens
View on GitHub
An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…
☆10May 9, 2024Updated 2 years ago
bcaitech1 / p3-mrc-team-ikyo
View on GitHub
Naver Boostcamp AI Tech Stage 3 : MRC (Machine Reading Comprehension)
☆10Jun 10, 2021Updated 5 years ago
2runo / dl_numpy
View on GitHub
NumPy로 구현한 딥러닝 라이브러리입니다. (자동 미분 지원)
☆15May 4, 2021Updated 5 years ago
hkust-nlp / GUIMid
View on GitHub
☆22May 3, 2025Updated last year
ASTRAL-Group / AlphaOne
View on GitHub
[EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
☆89Jun 10, 2025Updated last year
microsoft / webgym
View on GitHub
This project includes code for using the AsyncWebRL and WebGym frameworks to train web agent models.
☆46Jun 9, 2026Updated last month
kyle8581 / DialogueCoT
View on GitHub
[EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)
☆11Nov 15, 2023Updated 2 years ago
ZhuHaoranEIS / Orthogonal-FGOD
View on GitHub
☆11Mar 4, 2026Updated 4 months ago
microsoft / ExACT
View on GitHub
☆52Jul 10, 2026Updated last week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
baopj / Vid-Morp
View on GitHub
☆12Dec 6, 2024Updated last year
baixianghuang / survey-authorship
View on GitHub
Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…
☆19May 25, 2026Updated last month
baopj / E3M
View on GitHub
[ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.
☆11Jul 16, 2024Updated 2 years ago
Reza-esfandiarpoor / the-mcp-company
View on GitHub
TheMCPCompany: Creating General-purpose Agents with Task-specific Tools
☆16Dec 19, 2025Updated 7 months ago
kyle8581 / WMA-Agents
View on GitHub
Official code repository for "Web Agents with World Models [ICLR 2025]".
☆31Mar 2, 2025Updated last year
ljang0 / videowebarena
View on GitHub
☆14Dec 25, 2024Updated last year
fairyshine / Seal-Tools
View on GitHub
The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…
☆57Nov 5, 2024Updated last year
NeuralAction / NeuralAction
View on GitHub
Neural Action is a real-time CNN-based gaze tracking application providing human-machine interface to improve accessibility.
☆49Jun 5, 2020Updated 6 years ago
qiancheng0 / CREATOR
View on GitHub
This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"
☆31Oct 8, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BAAI-WuDao / EVA
View on GitHub
☆25Sep 29, 2021Updated 4 years ago
limafang / BubbleRAG
View on GitHub
Official source code repository for paper BubbleRAG.
☆16Jun 1, 2026Updated last month
test-time-interaction / TTI
View on GitHub
☆76Jun 10, 2025Updated last year
D-Star-AI / KITE
View on GitHub
KITE (Knowledge-Intensive Task Evaluation) is an end-to-end benchmark for RAG pipelines
☆24Aug 14, 2024Updated last year
WangWenhao0716 / PDF-Embedding
View on GitHub
[NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"
☆18Oct 1, 2024Updated last year
jfc43 / MARS
View on GitHub
MARS, a framework optimized for autonomous AI research
☆39May 19, 2026Updated 2 months ago
sanjibanc / agent_prm
View on GitHub
☆60Feb 19, 2025Updated last year
FreedomIntelligence / Smurfs
View on GitHub
Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning
☆15Jun 24, 2025Updated last year
boostcampaitech2 / final-project-level3-nlp-05
View on GitHub
[부스트캠프] 귀가노니 - 출퇴근길에 듣는 인공지능 뉴스 팟캐스트
☆13Feb 28, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SLIT-AI / WRPO
View on GitHub
[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
☆14Mar 17, 2025Updated last year
OpenDFM / MobA
View on GitHub
🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…
☆28Oct 10, 2025Updated 9 months ago
QingFei1 / R-Search
View on GitHub
[ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning
☆35Jan 4, 2026Updated 6 months ago
allenai / gpv2-web10k
View on GitHub
Download Web-10K data by querying Bing Image Search
☆10Feb 1, 2022Updated 4 years ago
BiEchi / chipyard
View on GitHub
☆10Oct 8, 2021Updated 4 years ago
AIGCResearch / styleme3d
View on GitHub
Official repo for StyleMe3D
☆30Apr 22, 2025Updated last year
LaVi-Lab / FTTT
View on GitHub
[ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.
☆13May 16, 2025Updated last year