QingFei1/R-Search

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QingFei1/R-Search)

QingFei1 / R-Search

[ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning

☆35

Alternatives and similar repositories for R-Search

Users that are interested in R-Search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KnowledgeXLab / O2-Searcher
View on GitHub
[TMLR 2026] A Searching-based Agent Model for Open-Domain Open-Ended Question Answering
☆39Jun 20, 2025Updated last year
EvolvingLMMs-Lab / multimodal-search-r1
View on GitHub
[ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…
☆469Apr 7, 2026Updated 3 months ago
syr-cn / AutoRefine
View on GitHub
[NeurIPS 2025] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning
☆142Jun 25, 2026Updated 3 weeks ago
GAIR-NLP / DeepResearcher
View on GitHub
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆781May 10, 2026Updated 2 months ago
bigai-nlco / RuleReasoner
View on GitHub
[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
☆39Feb 25, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
StibiumT16 / Robust-Fine-tuning
View on GitHub
Code for Robust Fine-tuning (RbFT)
☆19Jan 31, 2025Updated last year
stanford-futuredata / Baleen
View on GitHub
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)
☆48Dec 27, 2021Updated 4 years ago
calubkk / RAAT
View on GitHub
[ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training
☆43Oct 28, 2024Updated last year
DualityRL / multi-attempt
View on GitHub
☆19Mar 10, 2025Updated last year
WxxShirley / MoLoRAG
View on GitHub
[EMNLP 2025] Official implementation for paper "MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval"
☆26Mar 17, 2026Updated 4 months ago
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
ZhaolinGao / REFUEL
View on GitHub
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
☆25Oct 8, 2024Updated last year
ZhangXJ199 / EDGE-GRPO
View on GitHub
Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
☆22Aug 28, 2025Updated 10 months ago
yongchao98 / R1-Code-Interpreter
View on GitHub
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning
☆44Feb 9, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Xinyi-0724 / Search-R1-Qwen3
View on GitHub
Enhanced Search-R1 Implementation: Improved Compatibility and Modern Framework Integration
☆28Dec 8, 2025Updated 7 months ago
ai-wand / concise-reasoning
View on GitHub
Concise Reasoning via Reinforcement Learning
☆13Apr 16, 2025Updated last year
microsoft / x-reasoner
View on GitHub
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆49Feb 4, 2026Updated 5 months ago
RAG-Gym / RAG-Gym
View on GitHub
Official repository for RAG-Gym
☆124Jul 14, 2026Updated last week
RUCAIBox / SimpleDeepSearcher
View on GitHub
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
☆120Jun 3, 2025Updated last year
derenlei / FactCG
View on GitHub
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data (NAACL 2025)
☆17Jul 14, 2025Updated last year
thangylvp / MA-RAG
View on GitHub
☆23Aug 20, 2025Updated 11 months ago
dropbox / low-rank-llama2
View on GitHub
Low-Rank Llama Custom Training
☆23Mar 27, 2024Updated 2 years ago
MiliLab / REX-RAG
View on GitHub
Official repo for "REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-Augmented Generation"
☆35Sep 28, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
BeastyZ / ConvSearch-R1
View on GitHub
Official repo for paper ConvSearch-R1
☆62Nov 4, 2025Updated 8 months ago
THU-KEG / LRM-FactEval
View on GitHub
☆17Jun 25, 2025Updated last year
test-time-interaction / TTI
View on GitHub
☆76Jun 10, 2025Updated last year
tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
YBYBZhang / Tool-R1
View on GitHub
Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"
☆20Sep 16, 2025Updated 10 months ago
BY571 / SCoRe
View on GitHub
SCoRe: Training Language Models to Self-Correct via Reinforcement Learning
☆16May 14, 2026Updated 2 months ago
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
zihou98 / Whole-Slide-Image
View on GitHub
Working note for WSI analysis
☆10Apr 3, 2023Updated 3 years ago
Shujun-He / Google-Brain-Ventilator
View on GitHub
☆11Nov 11, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
allenai / hybrid-preferences
View on GitHub
Learning to route instances for Human vs AI Feedback (ACL Main '25)
☆29Jul 23, 2025Updated 11 months ago
zhiyuns / UNITPathSSL
View on GitHub
Official PyTorch implementation of the TMI paper "Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for…
☆16Mar 13, 2024Updated 2 years ago
Da1yuqin / EviNoteRAG
View on GitHub
Welcome! 😊 This is the official code release of EviNote-RAG, and we’re happy to share it with the community.
☆48Jun 4, 2026Updated last month
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
UKPLab / PeerQA
View on GitHub
Code and Data for PeerQA: A Scientific Question Answering Dataset from Peer Reviews, NAACL 2025 https://aclanthology.org/2025.naacl-long.…
☆15Jun 1, 2026Updated last month
wujwyi / PA-RAG
View on GitHub
[NAACL 2025 Main Conference] PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
☆27Mar 29, 2025Updated last year