sjtu-sai-agents / X-MasterLinks
Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.
☆206Updated last month
Alternatives and similar repositories for X-Master
Users that are interested in X-Master are comparing it to the libraries listed below
Sorting:
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆44Updated last month
- ☆300Updated last month
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆77Updated 3 weeks ago
- Awesome Agent Training☆204Updated 2 weeks ago
- ☆262Updated last week
- ☆157Updated 3 months ago
- ☆76Updated 2 months ago
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆470Updated 2 weeks ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆541Updated 3 months ago
- ☆761Updated 2 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆122Updated 4 months ago
- ☆174Updated 3 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆200Updated 2 weeks ago
- Repository for Zochi's Research☆248Updated last month
- ☆161Updated 2 weeks ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆142Updated last month
- CycleResearcher: Improving Automated Research via Automated Review☆219Updated 3 weeks ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆198Updated 3 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆80Updated 2 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆588Updated last week
- ☆152Updated 6 months ago
- ☆300Updated 2 months ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆225Updated last week
- The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"☆116Updated last month
- Test-time preferenece optimization (ICML 2025).☆155Updated 3 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆151Updated 3 weeks ago
- Awesome Deep Research list☆275Updated last month
- ☆286Updated last week
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆154Updated last month
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆146Updated 3 weeks ago