sjtu-sai-agents / ML-MasterLinks
The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"
☆116Updated last month
Alternatives and similar repositories for ML-Master
Users that are interested in ML-Master are comparing it to the libraries listed below
Sorting:
- Awesome Agent Training☆208Updated this week
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆135Updated last month
- ☆309Updated 2 months ago
- ☆152Updated 6 months ago
- Awesome Deep Research list☆275Updated last month
- ☆262Updated last week
- Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.☆216Updated last month
- CycleResearcher: Improving Automated Research via Automated Review☆220Updated last month
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆548Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆122Updated 4 months ago
- Build, evaluate and train General Multi-Agent Assistance with ease☆506Updated this week
- ☆310Updated 2 months ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆225Updated last week
- A High-Efficiency System of Large Language Model Based Search Agents☆71Updated last month
- Agentic RAG R1 Framework via Reinforcement Learning☆273Updated 2 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆275Updated 2 months ago
- ☆286Updated 2 weeks ago
- ☆88Updated this week
- AN O1 REPLICATION FOR CODING☆336Updated 8 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆588Updated last week
- ☆174Updated 3 months ago
- ☆161Updated 2 weeks ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models☆147Updated 2 months ago
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆44Updated last month
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆77Updated 3 weeks ago
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆268Updated 5 months ago
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆477Updated this week
- ☆77Updated 2 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆210Updated 2 weeks ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆101Updated last week