Just-A-Pie / MiniAgentStudioLinks
☆21Updated 3 weeks ago
Alternatives and similar repositories for MiniAgentStudio
Users that are interested in MiniAgentStudio are comparing it to the libraries listed below
Sorting:
- Awesome papers for role-playing with language models☆205Updated 11 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆250Updated this week
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆441Updated 8 months ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆133Updated 3 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆137Updated 2 months ago
- In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a…☆61Updated 6 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆326Updated last year
- A Survey on Large Language Model-Based Game Agents☆725Updated 2 weeks ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆384Updated 8 months ago
- ☆24Updated last year
- ☆956Updated 3 months ago
- ☆265Updated 4 months ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆88Updated 5 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆671Updated 8 months ago
- A version of verl to support diverse tool use☆583Updated this week
- An Awesome List of Agentic Model trained with Reinforcement Learning☆489Updated 3 weeks ago
- Building a comprehensive and handy list of papers for GUI agents☆518Updated 3 weeks ago
- AN O1 REPLICATION FOR CODING☆335Updated 10 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆247Updated last month
- A series of technical report on Slow Thinking with LLM☆739Updated 2 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆338Updated 3 months ago
- ☆342Updated 4 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆206Updated 5 months ago
- ☆549Updated 9 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆611Updated last month
- RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models☆506Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆193Updated 5 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆258Updated last year
- ☆211Updated 7 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆289Updated this week