Zillwang/StepSearch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Zillwang/StepSearch)

Zillwang / StepSearch

EMNLP MAIN 2025 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization

☆75

Alternatives and similar repositories for StepSearch

Users that are interested in StepSearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SiliangZeng / Multi-Turn-RL-Agent
View on GitHub
☆139Jun 11, 2025Updated last year
KnowledgeXLab / O2-Searcher
View on GitHub
[TMLR 2026] A Searching-based Agent Model for Open-Domain Open-Ended Question Answering
☆39Jun 20, 2025Updated last year
GAIR-NLP / DeepResearcher
View on GitHub
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆781May 10, 2026Updated 2 months ago
QingFei1 / R-Search
View on GitHub
[ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning
☆35Jan 4, 2026Updated 6 months ago
CarnegieBin / GlobalRAG
View on GitHub
This is the Ofiicial repository for paper: GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learni…
☆16May 3, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,123Nov 13, 2025Updated 8 months ago
AgentR1 / Agent-R1
View on GitHub
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
☆1,551Jul 13, 2026Updated last week
EvolvingLMMs-Lab / multimodal-search-r1
View on GitHub
[ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…
☆469Apr 7, 2026Updated 3 months ago
OpenMatch / Gist-COCO
View on GitHub
This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".
☆13Feb 27, 2024Updated 2 years ago
GuoqingWang1 / IGPO
View on GitHub
[ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents
☆127Jul 14, 2026Updated last week
knowledgeable-embedding / knowledgeable-embedding
View on GitHub
Knowledgeable Embedding: Injecting dynamically updatable entity knowledge into embeddings to enhance RAG
☆15Aug 31, 2025Updated 10 months ago
ReTool-RL / ReTool
View on GitHub
☆382Aug 12, 2025Updated 11 months ago
RUCAIBox / R1-Searcher
View on GitHub
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
☆720Aug 5, 2025Updated 11 months ago
weiyifan1023 / AutoTIR
View on GitHub
Code and Data for Paper "AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning"
☆54Sep 4, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Applied-Machine-Learning-Lab / ReasonRAG
View on GitHub
Code implementation of NeurIPS'25 paper "Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning"
☆16Sep 19, 2025Updated 10 months ago
gowitheflow-1998 / RAR-b
View on GitHub
☆21Jul 19, 2024Updated 2 years ago
Yuqi-Zhou / LRAT
View on GitHub
The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.
☆53Jul 14, 2026Updated last week
Agent-RL / ReCall
View on GitHub
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…
☆1,412May 16, 2025Updated last year
thinkwee / AgentsMeetRL
View on GitHub
Awesome List for Agentic RL
☆1,701Jun 20, 2026Updated last month
OoDBag / VisTA
View on GitHub
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
☆27May 31, 2025Updated last year
TsinghuaC3I / SSRL
View on GitHub
SSRL: Self-Search Reinforcement Learning
☆210Aug 20, 2025Updated 11 months ago
xieincz / visualization-final-project
View on GitHub
本项目综合运用d3、echarts来完成可视化工作，实现了对nba两场比赛的可视化数据分析，包括球员运动轨迹、个人数据、传球次数以及得分位置等多种可交互式图表。通过可视化方法，我们能够进一步深入分析球队的具体情况，便于制定更佳的战术。
☆15Dec 19, 2022Updated 3 years ago
NVlabs / Tool-N1
View on GitHub
☆230Jun 2, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Zhaoyi-Li21 / creme
View on GitHub
[ACL 2024] "Understanding and Patching Compositional Reasoning in LLMs"
☆14Aug 28, 2024Updated last year
aimagelab / ReflectiVA
View on GitHub
[CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
☆56Jul 14, 2025Updated last year
princeton-pli / PruLong
View on GitHub
Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"
☆48Jul 29, 2025Updated 11 months ago
William030422 / Video-Sycophancy
View on GitHub
Implementation for paper Flattery in Motion: Benchmarking and Analyzing Sycophancy in Video-LLMs, which is accepted by ACL 2026 (main con…
☆16Oct 10, 2025Updated 9 months ago
wlzhang2020 / LLMTreeRec
View on GitHub
The implement of LLMTreeRec
☆14Dec 9, 2024Updated last year
Lux0926 / ASPRM
View on GitHub
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
☆10Mar 2, 2025Updated last year
ventr1c / RES-GCL
View on GitHub
An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)
☆11Jan 22, 2024Updated 2 years ago
GAIR-NLP / lm-open-science-evaluation
View on GitHub
Reproducible and flexible LLM evaluations for scientific reasoning.
☆29Jul 23, 2025Updated 11 months ago
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,753Apr 14, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
DocTron-hub / Chart-R1
View on GitHub
Chart-R1: Chain-of-Thought Supervision and Reinforcement for Advanced Chart Reasoner
☆24Aug 7, 2025Updated 11 months ago
IBM / sql-rl-gen
View on GitHub
The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …
☆25Sep 18, 2025Updated 10 months ago
Yingjia-Wan / FaStfact
View on GitHub
Code repo for FaStfact: Faster, Stronger Long-Form Factuality Evaluations in LLMs.
☆33Nov 5, 2025Updated 8 months ago
TaiMingLu / know-dont-tell
View on GitHub
☆19Oct 14, 2024Updated last year
vipulgupta1011 / CALM
View on GitHub
☆11Oct 2, 2023Updated 2 years ago
PlusLabNLP / Active-IT
View on GitHub
Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"
☆26Nov 16, 2023Updated 2 years ago
mahaozhe / SASR
View on GitHub
[ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)
☆12Aug 26, 2025Updated 10 months ago