This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
☆153May 30, 2025Updated last year
Alternatives and similar repositories for Avalon-LLM
Users that are interested in Avalon-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Aug 13, 2024Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆15Aug 12, 2024Updated last year
- ☆53Aug 24, 2025Updated 9 months ago
- ☆24Oct 13, 2024Updated last year
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆97Jan 26, 2026Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- DataSciBench: An LLM Agent Benchmark for Data Science (Findings of ACL 2026)☆62Jan 21, 2026Updated 4 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆706Jan 20, 2025Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆69Mar 17, 2026Updated 3 months ago
- ☆32Oct 2, 2025Updated 8 months ago
- Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…☆32Nov 9, 2025Updated 7 months ago
- ☆28Jun 2, 2026Updated 2 weeks ago
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- ☆102Jun 12, 2024Updated 2 years ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,492Feb 8, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆21Jul 25, 2025Updated 10 months ago
- Computed Appraisals Model. Code and data for the 2023 paper, "Emotion prediction as computation over a generative theory of mind"☆13Jun 12, 2023Updated 3 years ago
- [WWW 2019] code for "Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification" https://arxiv.org/abs/1806.02557☆32Dec 1, 2019Updated 6 years ago
- ☆27May 30, 2026Updated 2 weeks ago
- ☆12May 6, 2024Updated 2 years ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆244Feb 24, 2025Updated last year
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Feb 15, 2024Updated 2 years ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 3 months ago
- ☆49Aug 6, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)☆21Nov 10, 2025Updated 7 months ago
- ☆47Oct 22, 2024Updated last year
- ☆15Mar 26, 2024Updated 2 years ago
- ☆14Aug 21, 2025Updated 9 months ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated last year
- ☆17Feb 22, 2025Updated last year
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆107Jul 2, 2024Updated last year
- [ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"☆20Oct 19, 2023Updated 2 years ago
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆18Jan 16, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆23Dec 17, 2024Updated last year
- Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"☆23Mar 30, 2024Updated 2 years ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆121Nov 4, 2025Updated 7 months ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆25Sep 25, 2025Updated 8 months ago
- Natural Language Reinforcement Learning☆101Jul 30, 2025Updated 10 months ago
- Sotopia-RL: Reward Design for Social Intelligence☆51Apr 1, 2026Updated 2 months ago
- Text-based game of lies and deceit, made for language models.☆32Aug 25, 2023Updated 2 years ago