jonathanmli / Avalon-LLM
This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
☆101Updated last month
Alternatives and similar repositories for Avalon-LLM:
Users that are interested in Avalon-LLM are comparing it to the libraries listed below
- ☆141Updated 10 months ago
- ☆103Updated 2 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆114Updated 10 months ago
- A banchmark list for evaluation of large language models.☆91Updated 2 weeks ago
- augmented LLM with self reflection☆117Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆293Updated 10 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆78Updated 3 weeks ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆106Updated last year
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆52Updated 4 months ago
- Reasoning with Language Model is Planning with World Model☆162Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆130Updated 4 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆130Updated 4 months ago
- Natural Language Reinforcement Learning☆82Updated 3 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆153Updated 11 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆94Updated last year
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆121Updated 3 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆44Updated last year
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆215Updated 2 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆135Updated 10 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated last week
- ☆102Updated 3 months ago
- ☆128Updated last week
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆108Updated 3 weeks ago
- FireAct: Toward Language Agent Fine-tuning☆271Updated last year
- ☆44Updated 3 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆165Updated this week
- Toy implementation of Strawberry☆31Updated 6 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆86Updated 5 months ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆58Updated 6 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆46Updated 4 months ago