sjtu-sai-agents / X-MasterLinks
Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.
☆298Updated 2 months ago
Alternatives and similar repositories for X-Master
Users that are interested in X-Master are comparing it to the libraries listed below
Sorting:
- The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"☆302Updated last week
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆511Updated 3 months ago
- CycleResearcher: Improving Automated Research via Automated Review☆319Updated 5 months ago
- ☆789Updated 2 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 8 months ago
- ☆254Updated 4 months ago
- A collection of resources and papers on AI Scientist / Robot Scientist☆117Updated 3 months ago
- ☆176Updated 2 months ago
- ☆404Updated 2 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆300Updated 2 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆680Updated 2 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆259Updated last month
- Repository for Zochi's Research☆297Updated last month
- ☆480Updated 2 months ago
- The official code of ARPO & AEPO☆839Updated last week
- Now, Stronger AI Pushes Frontiers, Stronger Our Shared Future.☆270Updated last month
- ☆325Updated 7 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆231Updated last month
- ☆206Updated 5 months ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆97Updated 5 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆380Updated last week
- ☆207Updated 5 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆531Updated last month
- [NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)☆513Updated 3 months ago
- Towards a Unified View of Large Language Model Post-Training☆198Updated 3 months ago
- [ACL 2025 Main] Multi-Agent System for Science of Science☆119Updated 5 months ago
- Paper list of agent for science☆180Updated 3 weeks ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 7 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆166Updated 2 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆384Updated last month