yanweiyue / AgentPrune
☆49Updated 3 weeks ago
Alternatives and similar repositories for AgentPrune:
Users that are interested in AgentPrune are comparing it to the libraries listed below
- ☆30Updated 4 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆114Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆74Updated 2 months ago
- ☆55Updated 6 months ago
- ☆51Updated 2 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆136Updated 11 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆57Updated 6 months ago
- Code of paper: Multi-agent Architecture Search via Agentic Supernet☆43Updated 3 weeks ago
- ☆42Updated 5 months ago
- ☆19Updated 2 weeks ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆27Updated 3 weeks ago
- ☆107Updated 2 weeks ago
- A research repo for experiments about Reinforcement Finetuning☆43Updated 2 weeks ago
- [NeurIPS 2024] Official implementation for paper "Can Graph Learning Improve Planning in LLM-based Agents?"☆120Updated 4 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆49Updated 5 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆75Updated 3 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆111Updated 7 months ago
- ☆91Updated last month
- Awesome Agent Training☆33Updated this week
- ☆30Updated last month
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆72Updated 3 weeks ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆92Updated last month
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆56Updated 5 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆43Updated 3 months ago
- ☆88Updated 3 months ago
- Accepted LLM Papers in NeurIPS 2024☆35Updated 6 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆60Updated this week
- The code of RouterDC☆57Updated last week
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆87Updated 3 weeks ago