sjtu-sai-agents / X-MasterLinks
Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.
☆246Updated last week
Alternatives and similar repositories for X-Master
Users that are interested in X-Master are comparing it to the libraries listed below
Sorting:
- ☆388Updated 3 weeks ago
- CycleResearcher: Improving Automated Research via Automated Review☆227Updated last month
- Awesome Agent Training☆215Updated 3 weeks ago
- ☆318Updated 2 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆568Updated 4 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆204Updated 4 months ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆83Updated last month
- ☆191Updated 2 weeks ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆305Updated last week
- ☆361Updated 2 weeks ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆313Updated this week
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆44Updated 2 months ago
- Repository for Zochi's Research☆260Updated this week
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆480Updated 3 weeks ago
- An Awesome List of Agentic Model trained with Reinforcement Learning☆420Updated this week
- ☆274Updated 3 months ago
- ☆312Updated last month
- ✨ Agentic Reinforced Policy Optimization☆512Updated last week
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆125Updated 5 months ago
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆274Updated this week
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆243Updated 2 weeks ago
- ☆81Updated 3 months ago
- ☆197Updated this week
- ☆157Updated 7 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆280Updated 3 months ago
- ☆161Updated 3 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆625Updated 3 weeks ago
- MiroFlow is an agent framework that simplifies the development of complex, multi-agent systems. Build, manage, and scale your AI agents w…☆384Updated this week
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆82Updated 3 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆298Updated this week