bingreeky / MaASLinks
[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet
β178Updated 3 months ago
Alternatives and similar repositories for MaAS
Users that are interested in MaAS are comparing it to the libraries listed below
Sorting:
- β82Updated 5 months ago
- π₯π₯π₯ ICLR 2025 Oral. Automating Agentic Workflow Generation.β259Updated last month
- A Framework for LLM-based Multi-Agent Reinforced Training and Inferenceβ246Updated last month
- β184Updated last month
- β86Updated 3 months ago
- β205Updated last month
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimizationβ168Updated last year
- [ICLR 2025] Benchmarking Agentic Workflow Generationβ128Updated 7 months ago
- Awesome Agent Trainingβ225Updated 2 weeks ago
- β287Updated 3 months ago
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaborationβ136Updated 2 weeks ago
- β40Updated 5 months ago
- β58Updated 9 months ago
- π§Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningβ253Updated 2 weeks ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language β¦β112Updated 4 months ago
- β472Updated 2 weeks ago
- β67Updated 3 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineeringβ206Updated 4 months ago
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"β106Updated 3 weeks ago
- β393Updated 2 weeks ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β127Updated 6 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruningβ87Updated 7 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"β79Updated 9 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reasoβ¦β127Updated 6 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ244Updated 4 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".β81Updated 3 months ago
- β108Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learningβ253Updated 4 months ago
- Test-time preferenece optimization (ICML 2025).β165Updated 4 months ago
- [COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?β34Updated 3 months ago