Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
☆45Jun 24, 2025Updated 9 months ago
Alternatives and similar repositories for ReasonRAG
Users that are interested in ReasonRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Removal of Hallucination on Hallucination: Debate-Augmented RAG☆39Aug 4, 2025Updated 8 months ago
- Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition (IJCAI 2024)☆11Aug 18, 2024Updated last year
- ☆22Jul 2, 2025Updated 9 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 6 months ago
- ☆28Jan 4, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- ☆33May 27, 2025Updated 10 months ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations☆29Aug 1, 2024Updated last year
- ☆29Feb 20, 2026Updated last month
- A Comprehensive Library for Memory of LLM-based Agents.☆110May 13, 2025Updated 11 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 9 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 6 months ago
- ☆15Sep 22, 2024Updated last year
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆23Updated this week
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆22Jul 30, 2025Updated 8 months ago
- QGEval: A Benchmark for Question Generation Evaluation☆19Nov 7, 2024Updated last year
- Code of the paper Relation-enhanced Negative Sampling for Multimodal Knowledge Graph Completion (ACM MM22))☆27May 22, 2024Updated last year
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆76May 25, 2025Updated 10 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆120Jan 29, 2025Updated last year
- Defeating the Training-Inference Mismatch via FP16☆188Nov 14, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Mar 17, 2025Updated last year
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆127Nov 19, 2025Updated 4 months ago
- ☆28Jul 9, 2025Updated 9 months ago
- Official Code for MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training (In ACL 2026 Main)☆38Apr 7, 2026Updated last week
- BusterX and BusterX++☆37Mar 9, 2026Updated last month
- [LREC-COLING 2024] Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models☆54May 13, 2025Updated 11 months ago
- Official repository for RAG-Gym☆121Mar 4, 2025Updated last year
- Control LLM☆22Apr 6, 2025Updated last year
- pdf multimodal rag 【pdf多模态rag问答】☆28Feb 26, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆23Sep 25, 2025Updated 6 months ago
- llm langchain quick start☆16Jun 14, 2023Updated 2 years ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆26Dec 5, 2023Updated 2 years ago
- Large-Vocabulary Continuous Sign Language Recognition, 2024☆16May 30, 2024Updated last year
- [COLING22] Text-to-Text Extraction and Verbalization of Biomedical Event Graphs☆10Nov 5, 2022Updated 3 years ago
- DeepSeek R1 distilled into smaller OSS models☆17Dec 2, 2025Updated 4 months ago
- ☆11Jun 7, 2023Updated 2 years ago