abilliyb / Knowledge_Injection_Survey_PapersLinks
☆43Updated 2 months ago
Alternatives and similar repositories for Knowledge_Injection_Survey_Papers
Users that are interested in Knowledge_Injection_Survey_Papers are comparing it to the libraries listed below
Sorting:
- ☆136Updated last month
- A trainable user simulator☆34Updated 2 weeks ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆77Updated 8 months ago
- This is the code of MMOA-RAG.☆60Updated 2 months ago
- ☆57Updated 3 weeks ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆62Updated 5 months ago
- ☆23Updated last month
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆24Updated 2 weeks ago
- Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors☆26Updated 2 months ago
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆26Updated 2 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆145Updated 6 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆85Updated 6 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆22Updated 5 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆81Updated last month
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆67Updated 2 months ago
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆33Updated 7 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆113Updated 3 weeks ago
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆23Updated 3 months ago
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆31Updated last month
- ☆64Updated last month
- ☆39Updated 5 months ago
- Code and Data for Our NeurIPS 2024 paper "AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback"☆33Updated 8 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆47Updated 2 weeks ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated last month
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆73Updated 5 months ago
- ☆96Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆79Updated last month
- A Survey of Personalization: From RAG to Agent☆54Updated this week
- ☆47Updated 4 months ago
- ☆54Updated 4 months ago