abilliyb / Knowledge_Injection_Survey_PapersLinks
☆44Updated 2 months ago
Alternatives and similar repositories for Knowledge_Injection_Survey_Papers
Users that are interested in Knowledge_Injection_Survey_Papers are comparing it to the libraries listed below
Sorting:
- ☆140Updated 2 months ago
- ☆67Updated last month
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆68Updated 2 weeks ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆117Updated 5 months ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆225Updated last week
- ☆58Updated last month
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆88Updated 7 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆147Updated 7 months ago
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆35Updated 8 months ago
- Test-time preferenece optimization (ICML 2025).☆155Updated 3 months ago
- ☆103Updated 8 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆91Updated 2 months ago
- This is the code of MMOA-RAG.☆64Updated 2 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆80Updated 2 months ago
- ☆50Updated 5 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆120Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆127Updated 4 months ago
- ☆152Updated 6 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆93Updated 2 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆115Updated 4 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization☆64Updated 5 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated last year
- ☆95Updated 7 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆122Updated 4 months ago
- LLM for Scientific Research Survey☆98Updated 6 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆109Updated 6 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆171Updated last week
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆62Updated 2 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆51Updated 2 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 6 months ago