Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
☆45Jun 24, 2025Updated 11 months ago
Alternatives and similar repositories for ReasonRAG
Users that are interested in ReasonRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Removal of Hallucination on Hallucination: Debate-Augmented RAG☆42Aug 4, 2025Updated 9 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆54Feb 10, 2025Updated last year
- Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition (IJCAI 2024)☆11Aug 18, 2024Updated last year
- ☆23Jul 2, 2025Updated 10 months ago
- ☆24Jul 26, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 7 months ago
- [ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning☆31Jan 4, 2026Updated 4 months ago
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- ☆33May 27, 2025Updated 11 months ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆12Mar 27, 2025Updated last year
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- The code of Advancing Expert Specialization for Better MoE (NeurIPS2025 oral)☆31Jan 22, 2026Updated 4 months ago
- Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations☆29Aug 1, 2024Updated last year
- A Comprehensive Library for Memory of LLM-based Agents.☆112May 13, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 10 months ago
- ☆11Jul 15, 2021Updated 4 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- ☆15Jul 6, 2022Updated 3 years ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 8 months ago
- ☆36Feb 21, 2025Updated last year
- ☆16Sep 22, 2024Updated last year
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆29Apr 23, 2026Updated last month
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆78May 25, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆121Jan 29, 2025Updated last year
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆129Apr 26, 2026Updated 3 weeks ago
- ☆28Jul 9, 2025Updated 10 months ago
- Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."☆23Dec 23, 2024Updated last year
- Official repository for RAG-Gym☆123Mar 4, 2025Updated last year
- Code of the paper Relation-enhanced Negative Sampling for Multimodal Knowledge Graph Completion (ACM MM22))☆30May 22, 2024Updated 2 years ago
- Control LLM☆23Apr 6, 2025Updated last year
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆29Jul 30, 2025Updated 9 months ago
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Apr 7, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆23Sep 25, 2025Updated 8 months ago
- llm langchain quick start☆16Jun 14, 2023Updated 2 years ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆26Dec 5, 2023Updated 2 years ago
- Large-Vocabulary Continuous Sign Language Recognition, 2024☆16May 30, 2024Updated last year
- [COLING22] Text-to-Text Extraction and Verbalization of Biomedical Event Graphs☆10Nov 5, 2022Updated 3 years ago
- DeepSeek R1 distilled into smaller OSS models for hobbyist☆17Dec 2, 2025Updated 5 months ago
- ☆11Jun 7, 2023Updated 2 years ago