qhjqhj00 / MetaAgentLinks
MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning
☆41Updated 5 months ago
Alternatives and similar repositories for MetaAgent
Users that are interested in MetaAgent are comparing it to the libraries listed below
Sorting:
- ☆87Updated 5 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Updated 7 months ago
- Agentic Learning Powered by AWorld☆88Updated this week
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 6 months ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆137Updated 5 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆66Updated 2 months ago
- ☆46Updated 8 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆118Updated 4 months ago
- ☆19Updated 11 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆141Updated 7 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆63Updated 8 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆93Updated 8 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆69Updated 9 months ago
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆75Updated 3 months ago
- ☆49Updated 5 months ago
- ☆23Updated last year
- ☆100Updated 6 months ago
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆101Updated 4 months ago
- The code and data for the paper JiuZhang3.0☆49Updated last year
- ☆43Updated 5 months ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆45Updated 7 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆96Updated 10 months ago
- ☆71Updated last year
- ☆21Updated 2 months ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆112Updated 2 weeks ago
- ☆16Updated last year
- ☆16Updated last year
- ☆39Updated 6 months ago
- Long Context Research☆26Updated 2 weeks ago