sjtu-sai-agents / ML-MasterLinks
The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"
☆154Updated 3 weeks ago
Alternatives and similar repositories for ML-Master
Users that are interested in ML-Master are comparing it to the libraries listed below
Sorting:
- ☆407Updated 3 weeks ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆625Updated last month
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆305Updated last week
- Awesome Agent Training☆215Updated 3 weeks ago
- ☆198Updated 2 weeks ago
- Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.☆246Updated last week
- AN O1 REPLICATION FOR CODING☆335Updated 8 months ago
- Build, evaluate and train General Multi-Agent Assistance with ease☆619Updated this week
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆242Updated this week
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆337Updated 3 weeks ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆147Updated last month
- MiroFlow is an agent framework that simplifies the development of complex, multi-agent systems. Build, manage, and scale your AI agents w…☆384Updated this week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆568Updated 4 months ago
- Awesome Deep Research list☆302Updated 2 months ago
- ☆318Updated 2 months ago
- ☆175Updated last month
- CycleResearcher: Improving Automated Research via Automated Review☆227Updated last month
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆156Updated 2 months ago
- Repository for Zochi's Research☆260Updated last week
- ☆136Updated last week
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆274Updated this week
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆125Updated 5 months ago
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆270Updated 6 months ago
- ☆158Updated 7 months ago
- ☆361Updated 2 weeks ago
- ☆791Updated 2 months ago
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆480Updated 3 weeks ago
- A High-Efficiency System of Large Language Model Based Search Agents☆72Updated last month
- ☆240Updated 8 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆327Updated this week