MASWorks / ML-AgentLinks
The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"
☆44Updated last month
Alternatives and similar repositories for ML-Agent
Users that are interested in ML-Agent are comparing it to the libraries listed below
Sorting:
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆210Updated 2 weeks ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆77Updated 3 weeks ago
- Awesome Agent Training☆208Updated this week
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆122Updated 4 months ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆225Updated last week
- ☆262Updated last week
- ☆174Updated 3 months ago
- Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.☆216Updated last month
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆156Updated 2 months ago
- The official code of “Agentic Reinforced Policy Optimization”, an agentic RL algorithm optimization.☆383Updated this week
- ☆309Updated 2 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆80Updated 2 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆147Updated 2 months ago
- ☆263Updated 2 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆548Updated 3 months ago
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆243Updated this week
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆81Updated 2 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆135Updated last month
- ☆159Updated 3 months ago
- ☆310Updated 2 months ago
- ☆152Updated 6 months ago
- ☆77Updated 2 months ago
- Test-time preferenece optimization (ICML 2025).☆158Updated 3 months ago
- ☆67Updated last month
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆161Updated last month
- Official repository for RAG-Gym☆112Updated 5 months ago
- Segment Policy Optimization: Improved Credit Assignment in Reinforcement Learning for LLMs☆27Updated 2 weeks ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆245Updated last week
- ☆58Updated last month
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆477Updated this week