sjtu-sai-agents / Browse-MasterLinks
Official implementation of Browse-Master, a tool-augmented web-search agent.
☆25Updated 4 months ago
Alternatives and similar repositories for Browse-Master
Users that are interested in Browse-Master are comparing it to the libraries listed below
Sorting:
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 11 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆52Updated 3 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆73Updated last year
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆55Updated 2 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- Geometric-Mean Policy Optimization☆96Updated last month
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆47Updated 10 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆156Updated 6 months ago
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆104Updated 5 months ago
- ☆191Updated 3 weeks ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 months ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆28Updated 3 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆40Updated 6 months ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆50Updated last month
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆58Updated 3 weeks ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆62Updated last year
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆68Updated 5 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆95Updated 9 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆63Updated 2 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 7 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆69Updated 7 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆147Updated 3 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆54Updated 8 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆27Updated 6 months ago
- ☆47Updated 3 months ago
- ☆32Updated 5 months ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆97Updated 3 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆126Updated 5 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Updated 8 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆73Updated last year