Fu-Dayuan / AgentRefineLinks
(ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning
☆19Updated 2 months ago
Alternatives and similar repositories for AgentRefine
Users that are interested in AgentRefine are comparing it to the libraries listed below
Sorting:
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆41Updated 5 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆69Updated 8 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆96Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Updated 7 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆118Updated 4 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆84Updated 3 months ago
- ☆104Updated last year
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆130Updated 10 months ago
- ☆87Updated 5 months ago
- Agentic Learning Powered by AWorld☆86Updated last week
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆64Updated 2 months ago
- PGRAG☆52Updated last year
- A Comprehensive Library for Memory of LLM-based Agents.☆100Updated 8 months ago
- LLM-in-Sandbox Elicits General Agentic Intelligence☆167Updated 2 weeks ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 8 months ago
- ☆84Updated last year
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆36Updated 5 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆306Updated 3 months ago
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆38Updated last year
- ☆46Updated 7 months ago
- ☆43Updated 5 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated last year
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆63Updated 6 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆62Updated 7 months ago
- ☆58Updated last year
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Updated last month
- DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.☆127Updated 3 weeks ago
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆56Updated 7 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆95Updated 3 months ago