qhjqhj00 / MetaAgentLinks
MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning
☆40Updated 3 months ago
Alternatives and similar repositories for MetaAgent
Users that are interested in MetaAgent are comparing it to the libraries listed below
Sorting:
- ☆86Updated 4 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 5 months ago
- ☆93Updated 7 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- Efficient Agent Training for Computer Use☆134Updated 3 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 4 months ago
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆61Updated last month
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆106Updated 2 months ago
- ☆45Updated 4 months ago
- ☆98Updated 4 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆67Updated 7 months ago
- ☆46Updated 6 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆60Updated 6 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆52Updated last week
- Agentic Learning Powered by AWorld☆57Updated this week
- ☆60Updated last year
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆143Updated 3 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆132Updated 5 months ago
- ☆38Updated 4 months ago
- ☆95Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆78Updated last year
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆44Updated 5 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆45Updated 3 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆214Updated 2 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆95Updated 8 months ago
- ☆23Updated last year
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆154Updated 5 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆67Updated 8 months ago
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆89Updated 2 months ago
- ☆53Updated 10 months ago