Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
☆46Jun 24, 2025Updated 11 months ago
Alternatives and similar repositories for ReasonRAG
Users that are interested in ReasonRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Dec 23, 2025Updated 5 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆54Feb 10, 2025Updated last year
- ☆24Jul 2, 2025Updated 11 months ago
- ☆25Jul 26, 2025Updated 10 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning☆33Jan 4, 2026Updated 5 months ago
- Code for Robust Fine-tuning (RbFT)☆18Jan 31, 2025Updated last year
- ☆32May 27, 2025Updated last year
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆12Mar 27, 2025Updated last year
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- The code of Advancing Expert Specialization for Better MoE (NeurIPS2025 oral)☆32Jan 22, 2026Updated 4 months ago
- Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations☆29Aug 1, 2024Updated last year
- A Comprehensive Library for Memory of LLM-based Agents.☆112May 13, 2025Updated last year
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Jul 15, 2021Updated 4 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- ☆15Jul 6, 2022Updated 3 years ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 8 months ago
- ☆36Feb 21, 2025Updated last year
- ☆16Sep 22, 2024Updated last year
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆29Apr 23, 2026Updated last month
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆79May 25, 2025Updated last year
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆122Jan 29, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Mar 17, 2025Updated last year
- ☆28Jul 9, 2025Updated 11 months ago
- Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."☆24Dec 23, 2024Updated last year
- Defeating the Training-Inference Mismatch via FP16☆194Nov 14, 2025Updated 7 months ago
- [LREC-COLING 2024] Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models☆55May 13, 2025Updated last year
- Official repository for RAG-Gym☆123Mar 4, 2025Updated last year
- Official Code for MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training (In ACL 2026 Main)☆43May 15, 2026Updated 3 weeks ago
- Control LLM☆23Apr 6, 2025Updated last year
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆31Jul 30, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Jul 1, 2024Updated last year
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Apr 7, 2026Updated 2 months ago
- Code of the paper Relation-enhanced Negative Sampling for Multimodal Knowledge Graph Completion (ACM MM22))☆32May 22, 2024Updated 2 years ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆25Sep 25, 2025Updated 8 months ago
- llm langchain quick start☆16Jun 14, 2023Updated 3 years ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆26Dec 5, 2023Updated 2 years ago
- Large-Vocabulary Continuous Sign Language Recognition, 2024☆16May 30, 2024Updated 2 years ago