EMNLP MAIN 2025 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization
☆60Sep 13, 2025Updated 6 months ago
Alternatives and similar repositories for StepSearch
Users that are interested in StepSearch are comparing it to the libraries listed below
Sorting:
- ☆120Jun 11, 2025Updated 9 months ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆31Mar 5, 2026Updated 2 weeks ago
- ☆26Jan 25, 2026Updated last month
- Implementation of self-certainty as an extention of ZeroEval Project☆36May 31, 2025Updated 9 months ago
- Official implementation of Browse-Master, a tool-augmented web-search agent.☆27Aug 22, 2025Updated 6 months ago
- Ready to run PyTorch implementation of Data2Vec 2.0: Highly efficient self-supervised representation learning for vision, speech and text…☆16Mar 29, 2023Updated 2 years ago
- An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)☆11Jan 22, 2024Updated 2 years ago
- ☆27Feb 12, 2026Updated last month
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆208Dec 11, 2025Updated 3 months ago
- The supplementary material for the paper "Fine-tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code R…☆16Aug 12, 2024Updated last year
- a within-document event coreference resolution system, trained and evaluated on the KBP corpus.☆10May 15, 2023Updated 2 years ago
- This is the official implementation of ICML 2025 paper MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding.☆22Jul 22, 2025Updated 7 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆700Aug 5, 2025Updated 7 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆159Sep 19, 2025Updated 6 months ago
- ☆16Sep 25, 2025Updated 5 months ago
- Segment a HTML document into structural data☆12Jan 15, 2019Updated 7 years ago
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- ☆14Oct 3, 2024Updated last year
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Apr 4, 2025Updated 11 months ago
- ☆304Aug 12, 2025Updated 7 months ago
- Scorpius: Poisoning scientific knowledge using large language models☆11Aug 3, 2024Updated last year
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated last year
- AgentHub is the LLM API Hub for the Agent era, built for high-precision autonomous agents. (GPT-5.4/Claude 4.6/Gemini 3.1)☆65Mar 12, 2026Updated last week
- 多因子打分选股☆13Jan 12, 2022Updated 4 years ago
- Codes and data for AAAI-24 paper "Advancing Spatial Reasoning in Large Language Models: An In-depth Evaluation and Enhancement Using the …☆14Apr 23, 2024Updated last year
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆29Nov 4, 2025Updated 4 months ago
- Code repo for FaStfact: Faster, Stronger Long-Form Factuality Evaluations in LLMs.☆32Nov 5, 2025Updated 4 months ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆45Jun 24, 2025Updated 8 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆194Sep 13, 2025Updated 6 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆53Jul 3, 2024Updated last year
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- The code repository for the paper "A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Cha…☆30Jun 2, 2025Updated 9 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆129Sep 28, 2025Updated 5 months ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆63Jan 23, 2026Updated last month
- ☆52Oct 10, 2024Updated last year
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆38Aug 22, 2025Updated 7 months ago
- ☆22Nov 11, 2024Updated last year
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 3 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆90Jul 13, 2025Updated 8 months ago