sjtu-sai-agents / Browse-MasterLinks
Official implementation of Browse-Master, a tool-augmented web-search agent.
☆25Updated 4 months ago
Alternatives and similar repositories for Browse-Master
Users that are interested in Browse-Master are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆73Updated last year
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 11 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆52Updated 3 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆156Updated 6 months ago
- ☆191Updated 3 weeks ago
- ☆28Updated 3 months ago
- Geometric-Mean Policy Optimization☆96Updated last month
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆62Updated last year
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆47Updated 10 months ago
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆104Updated 5 months ago
- ☆64Updated 2 months ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆28Updated 3 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆69Updated 7 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆58Updated 3 weeks ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆51Updated 5 months ago
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆68Updated 5 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆147Updated 3 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- Multimodal RewardBench☆58Updated 10 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆72Updated last year
- JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence☆74Updated last month
- ☆39Updated 3 weeks ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 7 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 6 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆63Updated 2 months ago
- ☆36Updated 3 months ago
- ☆69Updated 7 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆55Updated 2 months ago
- Code for Paper InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆43Updated 6 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆40Updated 6 months ago