yueshengbin / SMARTLinks
[AAAI 2025 Oral] Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
☆29Updated 9 months ago
Alternatives and similar repositories for SMART
Users that are interested in SMART are comparing it to the libraries listed below
Sorting:
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆169Updated 8 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆312Updated 3 weeks ago
- ☆32Updated 8 months ago
- ☆104Updated 3 months ago
- ☆198Updated last year
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆237Updated 2 months ago
- [ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models☆108Updated 8 months ago
- ☆192Updated 3 months ago
- ☆70Updated 7 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆153Updated 3 months ago
- Reinforced Multi-LLM Agents training☆69Updated 2 weeks ago
- A curated list of personalized alignment resources (continually updated).☆57Updated 3 months ago
- The awesome agents in the era of large language models☆71Updated 2 years ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆411Updated 2 months ago
- ☆182Updated last week
- A comprehensive collection of process reward models.☆135Updated 3 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆62Updated 4 months ago
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆94Updated this week
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆155Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Updated last year
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆162Updated last year
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆192Updated last year
- ☆20Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 10 months ago
- ☆332Updated 8 months ago
- Benchmarking Multi-Agent Debate between Language Models for Truthfulness in Q&A.☆48Updated last year
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆290Updated 2 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆143Updated 2 months ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆97Updated 6 months ago
- ☆25Updated 2 years ago