qhjqhj00 / MetaAgentLinks
MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning
☆41Updated 4 months ago
Alternatives and similar repositories for MetaAgent
Users that are interested in MetaAgent are comparing it to the libraries listed below
Sorting:
- ☆87Updated 5 months ago
- ☆46Updated 7 months ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆135Updated 4 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆43Updated 11 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 6 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 5 months ago
- ☆53Updated 11 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- Agentic Learning Powered by AWorld☆80Updated last week
- ☆23Updated last year
- Scaling Preference Data Curation via Human-AI Synergy☆137Updated 6 months ago
- ☆92Updated 8 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆60Updated 7 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆95Updated 9 months ago
- ☆100Updated 5 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆114Updated 3 months ago
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆63Updated 5 months ago
- ☆19Updated 10 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆68Updated 8 months ago
- ☆62Updated last year
- ☆50Updated 7 months ago
- ☆16Updated last year
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆150Updated 4 months ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆35Updated 7 months ago
- ☆47Updated 3 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Updated 9 months ago
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆72Updated 2 months ago
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆32Updated 5 months ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆101Updated this week
- ☆96Updated last year