Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
☆45Jun 24, 2025Updated 9 months ago
Alternatives and similar repositories for ReasonRAG
Users that are interested in ReasonRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Dec 23, 2025Updated 3 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆52Feb 10, 2025Updated last year
- Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition (IJCAI 2024)☆11Aug 18, 2024Updated last year
- ☆22Jul 2, 2025Updated 8 months ago
- ☆24Jul 26, 2025Updated 7 months ago
- ☆28Jan 4, 2026Updated 2 months ago
- ☆33May 27, 2025Updated 9 months ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- Implementation of Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. Paper: https://arxiv.org/abs/2404.06809☆22Oct 22, 2024Updated last year
- Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations☆29Aug 1, 2024Updated last year
- ☆27Feb 20, 2026Updated last month
- A Comprehensive Library for Memory of LLM-based Agents.☆107May 13, 2025Updated 10 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 8 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 6 months ago
- Terminal-Bench-Science: Evaluating AI Agents on Complex Real-World Scientific Workflows in the Terminal☆43Updated this week
- ☆36Feb 21, 2025Updated last year
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆22Jul 30, 2025Updated 7 months ago
- ☆23Jan 19, 2026Updated 2 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆127Nov 19, 2025Updated 4 months ago
- Code of the paper Relation-enhanced Negative Sampling for Multimodal Knowledge Graph Completion (ACM MM22))☆27May 22, 2024Updated last year
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆76May 25, 2025Updated 9 months ago
- Defeating the Training-Inference Mismatch via FP16☆183Nov 14, 2025Updated 4 months ago
- ☆14Mar 17, 2025Updated last year
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆120Jan 29, 2025Updated last year
- QGEval: A Benchmark for Question Generation Evaluation☆19Nov 7, 2024Updated last year
- Official Code for MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training☆38Oct 18, 2025Updated 5 months ago
- ☆28Jul 9, 2025Updated 8 months ago
- BusterX and BusterX++☆37Mar 9, 2026Updated 2 weeks ago
- Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."☆23Dec 23, 2024Updated last year
- [LREC-COLING 2024] Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models☆54May 13, 2025Updated 10 months ago
- Official repository for RAG-Gym☆122Mar 4, 2025Updated last year
- Control LLM☆22Apr 6, 2025Updated 11 months ago
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Sep 25, 2025Updated 6 months ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆22Sep 25, 2025Updated 6 months ago
- llm langchain quick start☆16Jun 14, 2023Updated 2 years ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆27Dec 5, 2023Updated 2 years ago
- Large-Vocabulary Continuous Sign Language Recognition, 2024☆15May 30, 2024Updated last year
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆59Oct 14, 2025Updated 5 months ago