abilliyb / Knowledge_Injection_Survey_PapersLinks
β60Updated 6 months ago
Alternatives and similar repositories for Knowledge_Injection_Survey_Papers
Users that are interested in Knowledge_Injection_Survey_Papers are comparing it to the libraries listed below
Sorting:
- π§Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningβ289Updated 3 weeks ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"β79Updated 2 weeks ago
- β69Updated 5 months ago
- This is the code of MMOA-RAG.β86Updated 6 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningβ153Updated 10 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generationβ133Updated 9 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.β79Updated 2 weeks ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."β92Updated last year
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learningβ65Updated 5 months ago
- LLM for Scientific Research Surveyβ113Updated 9 months ago
- β165Updated last month
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language β¦β138Updated 6 months ago
- The demo, code and data of FollowRAGβ75Updated 4 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving theβ¦β169Updated 4 months ago
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".β27Updated 2 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large β¦β97Updated 10 months ago
- β162Updated 9 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.β63Updated 3 weeks ago
- RM-R1: Unleashing the Reasoning Potential of Reward Modelsβ148Updated 4 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agentsβ177Updated 2 weeks ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β132Updated 7 months ago
- A Survey of Personalization: From RAG to Agentβ80Updated 3 months ago
- β26Updated 7 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reasoβ¦β129Updated 8 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward modelβ¦β58Updated 5 months ago
- A curated paper list on LLM reasoning.β89Updated last year
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Finβ¦β37Updated 11 months ago
- Official repository for RAG-Gymβ115Updated 8 months ago
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generationβ45Updated last month
- EMNLP MAIN 2025 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimizationβ43Updated 2 months ago