Agent-RL / ReSearch

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

☆676

Alternatives and similar repositories for ReSearch:

Users that are interested in ReSearch are comparing it to the libraries listed below

RUCAIBox / R1-Searcher
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
☆455Updated this week
sunnynexus / Search-o1
Search-o1: Agentic Search-Enhanced Large Reasoning Models
☆805Updated 3 weeks ago
RUCAIBox / Slow_Thinking_with_LLMs
A series of technical report on Slow Thinking with LLM
☆644Updated last week
GAIR-NLP / DeepResearcher
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆244Updated last week
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆802Updated 4 months ago
0russwest0 / Agent-R1
☆381Updated this week
zhentingqi / rStar
☆920Updated 2 months ago
WooooDyy / AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…
☆449Updated last month
RAGEN-AI / RAGEN
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆1,382Updated this week
Qihoo360 / Light-R1
☆659Updated last week
huggingface / Math-Verify
☆630Updated 3 weeks ago
theworldofagents / Agentic-Reasoning
free and open OpenAI Deep Research
☆518Updated 2 months ago
GAIR-NLP / LIMO
LIMO: Less is More for Reasoning
☆913Updated 2 weeks ago
CraftJarvis / RAT
Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".
☆230Updated 10 months ago
lqtrung1998 / mwp_ReFT
☆518Updated 3 months ago
magpie-align / magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …
☆676Updated last month
Gen-Verse / ReasonFlux
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates
☆373Updated 2 weeks ago
eddycmu / demystify-long-cot
☆282Updated last month
fate-ubw / RAGLAB
[EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
☆294Updated 6 months ago
BytedTsinghua-SIA / DAPO
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,141Updated last week
ADaM-BJTU / O1-CODER
AN O1 REPLICATION FOR CODING
☆334Updated 4 months ago
microsoft / rStar
☆518Updated last week
0russwest0 / Awesome-Agent-RL
☆135Updated 3 weeks ago
AIDC-AI / Marco-o1
An Open Large Reasoning Model for Real-World Solutions
☆1,484Updated last month
ByteDance-Seed / Seed-Thinking-v1.5
☆698Updated this week
sail-sg / understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective
☆863Updated last week
facebookresearch / swe-rl
Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆503Updated last month
Reason-Wang / ToolGen
[ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
☆136Updated 3 weeks ago
oneal2000 / PRAG
Code for Parametric RAG, SIGIR 2025 Full Paper
☆154Updated last week
PeterGriffinJin / Search-R1
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆1,928Updated last week