This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
☆150May 30, 2025Updated 11 months ago
Alternatives and similar repositories for Avalon-LLM
Users that are interested in Avalon-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Aug 13, 2024Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆15Aug 12, 2024Updated last year
- ☆53Aug 24, 2025Updated 8 months ago
- ☆24Oct 13, 2024Updated last year
- The code used to power DeepRole☆37Nov 21, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Aug 22, 2025Updated 8 months ago
- ☆124Feb 21, 2025Updated last year
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆97Jan 26, 2026Updated 3 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆702Jan 20, 2025Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆64Mar 17, 2026Updated last month
- ☆29Oct 2, 2025Updated 7 months ago
- ☆47Jun 24, 2025Updated 10 months ago
- Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…☆29Nov 9, 2025Updated 6 months ago
- ☆28Nov 10, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆100Jun 12, 2024Updated last year
- Learning to Group Auxiliary Datasets for Molecule, NeurIPS2023☆18Dec 19, 2023Updated 2 years ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,399Feb 8, 2026Updated 3 months ago
- ☆21Jul 25, 2025Updated 9 months ago
- Computed Appraisals Model. Code and data for the 2023 paper, "Emotion prediction as computation over a generative theory of mind"☆13Jun 12, 2023Updated 2 years ago
- ☆14May 9, 2024Updated 2 years ago
- ☆27Feb 13, 2026Updated 2 months ago
- ☆12May 6, 2024Updated 2 years ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆242Feb 24, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Feb 15, 2024Updated 2 years ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 2 months ago
- ☆49Aug 6, 2024Updated last year
- Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)☆21Nov 10, 2025Updated 5 months ago
- ☆15Mar 26, 2024Updated 2 years ago
- ☆14Aug 21, 2025Updated 8 months ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated last year
- ☆16Feb 22, 2025Updated last year
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆102Jul 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆19Jan 16, 2025Updated last year
- ☆19Aug 22, 2025Updated 8 months ago
- ☆23Dec 17, 2024Updated last year
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆117Nov 4, 2025Updated 6 months ago
- Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"☆23Mar 30, 2024Updated 2 years ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆23Sep 25, 2025Updated 7 months ago
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 9 months ago