sjtu-sai-agents / Browse-MasterLinks
Official implementation of Browse-Master, a tool-augmented web-search agent.
☆23Updated 3 months ago
Alternatives and similar repositories for Browse-Master
Users that are interested in Browse-Master are comparing it to the libraries listed below
Sorting:
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆154Updated 5 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆51Updated 2 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆73Updated last year
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆36Updated 2 weeks ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last week
- EMNLP MAIN 2025 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization☆45Updated 3 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 10 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆66Updated 6 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆55Updated 2 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆62Updated last year
- ☆66Updated 6 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆40Updated 5 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆131Updated last month
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆95Updated 8 months ago
- ☆22Updated last month
- instruction-following benchmark for large reasoning models☆45Updated 4 months ago
- Test-time preferenece optimization (ICML 2025).☆172Updated 7 months ago
- ☆83Updated last week
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆25Updated 4 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆118Updated 7 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 9 months ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆27Updated 3 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆73Updated last year
- REverse-Engineered Reasoning for Open-Ended Generation☆84Updated 3 months ago
- SSRL: Self-Search Reinforcement Learning☆158Updated 4 months ago
- ☆51Updated 10 months ago
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆89Updated 2 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆62Updated last month
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆117Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆70Updated 6 months ago