This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
☆142May 30, 2025Updated 9 months ago
Alternatives and similar repositories for Avalon-LLM
Users that are interested in Avalon-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Aug 13, 2024Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆14Aug 12, 2024Updated last year
- ☆53Aug 24, 2025Updated 7 months ago
- The code used to power DeepRole☆37Nov 21, 2022Updated 3 years ago
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆95Jan 26, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆124Feb 21, 2025Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆55Mar 17, 2026Updated last week
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆695Jan 20, 2025Updated last year
- ☆28Oct 2, 2025Updated 5 months ago
- ☆46Jun 24, 2025Updated 9 months ago
- ☆28Nov 10, 2025Updated 4 months ago
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- Codebase for the ACL 2023 paper: White-Box Multi-Objective Adversarial Attack on Dialogue Generation.☆16Dec 8, 2023Updated 2 years ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,253Feb 8, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆12May 6, 2024Updated last year
- ☆21Jul 25, 2025Updated 8 months ago
- ☆28Feb 13, 2026Updated last month
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆238Feb 24, 2025Updated last year
- ☆70Mar 22, 2024Updated 2 years ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆21Jun 13, 2025Updated 9 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated last month
- Using conversational games to evaluate powerful LLMs☆18Sep 3, 2023Updated 2 years ago
- ☆46Oct 22, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)☆21Nov 10, 2025Updated 4 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆92Jul 2, 2024Updated last year
- ☆15Mar 26, 2024Updated 2 years ago
- ☆14Aug 21, 2025Updated 7 months ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated last year
- ☆16Feb 22, 2025Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆47Jan 29, 2026Updated last month
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆19Jan 16, 2025Updated last year
- ☆19Aug 22, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"☆20Oct 19, 2023Updated 2 years ago
- ☆23Dec 17, 2024Updated last year
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆108Nov 4, 2025Updated 4 months ago
- Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"☆23Mar 30, 2024Updated last year
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆22Sep 25, 2025Updated 6 months ago
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 7 months ago
- Text-based game of lies and deceit, made for language models.☆32Aug 25, 2023Updated 2 years ago