OSU-NLP-Group / ExplorerLinks
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
☆17Updated last month
Alternatives and similar repositories for Explorer
Users that are interested in Explorer are comparing it to the libraries listed below
Sorting:
- ☆21Updated 4 months ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆80Updated 5 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆122Updated 3 months ago
- ☆59Updated 3 months ago
- An Illusion of Progress? Assessing the Current State of Web Agents☆83Updated last month
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆143Updated 9 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆149Updated 10 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆83Updated 3 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆20Updated last month
- ☆19Updated 6 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆56Updated 3 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 8 months ago
- Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆78Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 3 months ago
- ☆48Updated 4 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆61Updated 3 months ago
- ☆49Updated 10 months ago
- ☆90Updated 3 weeks ago
- ☆33Updated 3 weeks ago
- ☆98Updated 3 weeks ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆110Updated 4 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆69Updated 4 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆77Updated last year
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆96Updated 5 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆79Updated 5 months ago
- A repo for open research on building large reasoning models☆102Updated this week
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆67Updated 2 months ago
- ☆45Updated this week
- instruction-following benchmark for large reasoning models☆40Updated last month
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆37Updated last month