MASWorks / ML-AgentLinks
The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"
☆56Updated 7 months ago
Alternatives and similar repositories for ML-Agent
Users that are interested in ML-Agent are comparing it to the libraries listed below
Sorting:
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆165Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆131Updated 10 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆303Updated 3 months ago
- ☆255Updated 5 months ago
- Official implementation of MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems☆72Updated 6 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆306Updated 2 weeks ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆276Updated last month
- ☆186Updated 3 months ago
- ☆327Updated 7 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆524Updated 4 months ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆97Updated 6 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆142Updated 11 months ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆168Updated 2 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆94Updated 2 months ago
- ☆179Updated 8 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆130Updated 2 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆234Updated 2 months ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 7 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆82Updated 2 months ago
- [COLM2025] "Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors"☆54Updated 3 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆111Updated 3 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆398Updated 3 weeks ago
- ☆212Updated 5 months ago
- The official implementation of "ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning"☆329Updated last week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆689Updated 3 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆68Updated 8 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆113Updated 3 weeks ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆141Updated last year
- ☆97Updated 9 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆161Updated last month