Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
☆46Jun 24, 2025Updated last year
Alternatives and similar repositories for ReasonRAG
Users that are interested in ReasonRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Removal of Hallucination on Hallucination: Debate-Augmented RAG☆44Aug 4, 2025Updated 11 months ago
- ☆18Dec 23, 2025Updated 6 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆55Feb 10, 2025Updated last year
- Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition (IJCAI 2024)☆11Aug 18, 2024Updated last year
- ☆24Jul 2, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆26Jul 26, 2025Updated 11 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 8 months ago
- [ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning☆35Jan 4, 2026Updated 6 months ago
- Code for Robust Fine-tuning (RbFT)☆19Jan 31, 2025Updated last year
- ☆32May 27, 2025Updated last year
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆13Mar 27, 2025Updated last year
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- The code of Advancing Expert Specialization for Better MoE (NeurIPS2025 oral)☆34Jan 22, 2026Updated 5 months ago
- Implementation of Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. Paper: https://arxiv.org/abs/2404.06809☆22Oct 22, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations☆29Aug 1, 2024Updated last year
- A Comprehensive Library for Memory of LLM-based Agents.☆112May 13, 2025Updated last year
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated last year
- ☆11Jul 15, 2021Updated 4 years ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 9 months ago
- ☆37Feb 21, 2025Updated last year
- ☆16Sep 22, 2024Updated last year
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆30Apr 23, 2026Updated 2 months ago
- QGEval: A Benchmark for Question Generation Evaluation☆19Nov 7, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆80May 25, 2025Updated last year
- ☆28Jul 9, 2025Updated 11 months ago
- BusterX and BusterX++☆42Jun 16, 2026Updated 2 weeks ago
- Defeating the Training-Inference Mismatch via FP16☆196Nov 14, 2025Updated 7 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆130May 26, 2026Updated last month
- [LREC-COLING 2024] Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models☆55May 13, 2025Updated last year
- Official repository for RAG-Gym☆123Mar 4, 2025Updated last year
- Official Code for MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training (In ACL 2026 Main)☆43May 15, 2026Updated last month
- ☆21Jul 1, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Apr 7, 2026Updated 2 months ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆25Sep 25, 2025Updated 9 months ago
- llm langchain quick start☆16Jun 14, 2023Updated 3 years ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆26Dec 5, 2023Updated 2 years ago
- [COLING22] Text-to-Text Extraction and Verbalization of Biomedical Event Graphs☆10Nov 5, 2022Updated 3 years ago
- ☆11Jun 7, 2023Updated 3 years ago
- Code for Paper "Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation"☆12Feb 6, 2023Updated 3 years ago