wlzhang2020 / ReasonRAGLinks
Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
☆22Updated this week
Alternatives and similar repositories for ReasonRAG
Users that are interested in ReasonRAG are comparing it to the libraries listed below
Sorting:
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆19Updated 7 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆22Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆80Updated last year
- ☆17Updated 11 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆41Updated last year
- [Findings of EMNLP'2024] Unified Active Retrieval for Retrieval Augmented Generation☆21Updated 8 months ago
- Towards Systematic Measurement for Long Text Quality☆35Updated 9 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆49Updated last year
- Trending projects & awesome papers about data-centric llm studies.☆36Updated 2 weeks ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 5 months ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆32Updated last year
- Test-time compute in information retrieval☆30Updated last month
- ☆16Updated 10 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- [COLM'24] "How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?"☆22Updated 7 months ago
- ☆47Updated 5 months ago
- ☆14Updated last year
- ☆16Updated last year
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated last year
- ☆18Updated this week
- ☆29Updated 5 months ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Updated 6 months ago
- ☆57Updated 7 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆43Updated 5 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆69Updated 9 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆57Updated 3 weeks ago
- ☆19Updated 2 years ago
- Automatic prompt optimization framework for multi-step agent tasks.☆31Updated 6 months ago