sjtu-sai-agents / ML-MasterLinks
The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"
☆302Updated last week
Alternatives and similar repositories for ML-Master
Users that are interested in ML-Master are comparing it to the libraries listed below
Sorting:
- ☆789Updated 2 months ago
- Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.☆298Updated 2 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆510Updated 3 months ago
- ☆254Updated 4 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆844Updated 5 months ago
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆55Updated 6 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆521Updated last month
- Build, evaluate and train General Multi-Agent Assistance with ease☆1,082Updated last week
- DeepConf: Deep Think with Confidence☆338Updated 3 months ago
- SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasonin…☆209Updated 3 months ago
- ☆865Updated 4 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆679Updated 2 months ago
- AgentEvolver: Towards Efficient Self-Evolving Agent System☆989Updated last week
- CycleResearcher: Improving Automated Research via Automated Review☆317Updated 5 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆377Updated last week
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆522Updated last month
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆182Updated 5 months ago
- Repository for Zochi's Research☆297Updated last month
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 8 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆298Updated 2 months ago
- ☆207Updated 5 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆298Updated 2 months ago
- Awesome Deep Research list! For more details, please refer to our survey paper -- A Comprehensive Survey of Deep Research: Systems, Metho…☆385Updated 2 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)☆512Updated 3 months ago
- ☆403Updated 2 months ago
- The official code of ARPO & AEPO☆839Updated this week
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆97Updated 5 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,116Updated last month
- [Up-to-date] Awesome Agentic Deep Research Resources☆586Updated 4 months ago
- ☆472Updated 2 months ago