This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
☆144May 30, 2025Updated 10 months ago
Alternatives and similar repositories for Avalon-LLM
Users that are interested in Avalon-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"☆18Aug 13, 2024Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆14Aug 12, 2024Updated last year
- The code used to power DeepRole☆37Nov 21, 2022Updated 3 years ago
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆97Jan 26, 2026Updated 2 months ago
- ☆124Feb 21, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- DataSciBench: An LLM Agent Benchmark for Data Science☆55Jan 21, 2026Updated 2 months ago
- ☆24Oct 13, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆60Mar 17, 2026Updated 3 weeks ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆701Jan 20, 2025Updated last year
- ☆28Oct 2, 2025Updated 6 months ago
- ☆46Jun 24, 2025Updated 9 months ago
- Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…☆28Nov 9, 2025Updated 5 months ago
- ☆28Nov 10, 2025Updated 5 months ago
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Codebase for the ACL 2023 paper: White-Box Multi-Objective Adversarial Attack on Dialogue Generation.☆16Dec 8, 2023Updated 2 years ago
- ☆100Jun 12, 2024Updated last year
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,316Feb 8, 2026Updated 2 months ago
- ☆21Jul 25, 2025Updated 8 months ago
- ☆14May 9, 2024Updated last year
- [WWW 2019] code for "Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification" https://arxiv.org/abs/1806.02557☆32Dec 1, 2019Updated 6 years ago
- ☆27Feb 13, 2026Updated 2 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Feb 15, 2024Updated 2 years ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆22Jun 13, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated last month
- ☆49Aug 6, 2024Updated last year
- ☆46Oct 22, 2024Updated last year
- Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)☆21Nov 10, 2025Updated 5 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆93Jul 2, 2024Updated last year
- ☆15Mar 26, 2024Updated 2 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated last year
- ☆16Feb 22, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Aug 22, 2025Updated 7 months ago
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆19Jan 16, 2025Updated last year
- ☆23Dec 17, 2024Updated last year
- Harmonizing N8N, NocoDB, One-API, and Fastchat to forge an accessible and intuitive AI flows integration platform. ⚡ 融合N8N、NocoDB、One-AP…☆11Jan 5, 2024Updated 2 years ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆114Nov 4, 2025Updated 5 months ago
- Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"☆23Mar 30, 2024Updated 2 years ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆23Sep 25, 2025Updated 6 months ago