google/werewolf_arena

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google/werewolf_arena)

google / werewolf_arena

☆48

Alternatives and similar repositories for werewolf_arena

Users that are interested in werewolf_arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

spinbench / spinbench
View on GitHub
☆28May 30, 2026Updated last month
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated 2 years ago
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
ljcleo / agent_sense
View on GitHub
Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
☆13Jan 4, 2025Updated last year
bowen-upenn / Multi-Agent-VQA
View on GitHub
[CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering
☆22Sep 21, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PathPlanning / ManipulationPlanning-SI-RRT
View on GitHub
Combination of Rapidly-Exporing Random Trees (RRT) and Safe Interval Path Planning (SIPP) for high-DOF planning in dynamic environments,…
☆18May 17, 2026Updated last month
chenlong-clock / RULE-Unlearn
View on GitHub
[NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality
☆20Oct 22, 2025Updated 8 months ago
microsoft / tale-suite
View on GitHub
Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.
☆30May 9, 2026Updated 2 months ago
AIRI-Institute / eco4cast
View on GitHub
eco4cast library aims to reduce carbon footprint of machine learning models with predictive cloud computing scheduling
☆16Aug 26, 2024Updated last year
voxel51 / reconstruction-error-ratios
View on GitHub
Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!
☆28Jan 10, 2025Updated last year
RainBowLuoCS / MMEvol
View on GitHub
(ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"
☆21May 15, 2025Updated last year
Cognitive-AI-Systems / RATE
View on GitHub
☆16Sep 4, 2024Updated last year
StonyBrookNLP / PerSenT
View on GitHub
[COLING2020] A challenge dataset for Person SenTiment analysis in news domain.
☆11May 2, 2022Updated 4 years ago
zuucan / NeedleInAHaystack-PLUS
View on GitHub
To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…
☆13Mar 4, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ybwang119 / label_recovery
View on GitHub
[ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks
☆14Feb 6, 2024Updated 2 years ago
rhyang2021 / ARIA
View on GitHub
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆30Aug 9, 2025Updated 11 months ago
cog-model / NPField
View on GitHub
Neural Potential Field for Obstacle-Aware Local Motion Planning
☆23Jun 2, 2024Updated 2 years ago
yuddim / deepClassificationTool
View on GitHub
Deep image classification tool based on Keras. Tool implements light versions of VGG, ResNet and InceptionV3 for small images
☆16May 14, 2018Updated 8 years ago
TextArena / UnstableBaselines
View on GitHub
☆120Apr 7, 2026Updated 3 months ago
Mercidaiha / IRT-Router
View on GitHub
[ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"
☆34Feb 19, 2025Updated last year
maitrix-org / dynamic-alignment-optimization
View on GitHub
[EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…
☆24Nov 17, 2024Updated last year
kevinwortman / advanced-algorithms-slides
View on GitHub
my slides for an advanced algorithms course
☆15Apr 26, 2025Updated last year
kxfan2002 / Reagent
View on GitHub
Agent-RRM: Exploring Reasoning Reward Model for Agents
☆70Mar 17, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ZhaolinGao / REFUEL
View on GitHub
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
☆25Oct 8, 2024Updated last year
Gen-Verse / GenEnv
View on GitHub
GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators
☆62Dec 23, 2025Updated 6 months ago
thu-nics / MARSHAL
View on GitHub
[ICLR'26] MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
☆53Apr 17, 2026Updated 2 months ago
Cognitive-AI-Systems / WaRP
View on GitHub
WaRP dataset includes labeled images of an industrial conveyor.
☆19Dec 22, 2023Updated 2 years ago
Cognitive-AI-Systems / IGOR
View on GitHub
☆16Sep 5, 2024Updated last year
iglu-contest / iglu-2022-rl-baseline
View on GitHub
Modular and Hierachical RL baseline solution for the IGLU RL track @ NeurIPS 2022
☆20Sep 16, 2022Updated 3 years ago
x35f / model_based_rl
View on GitHub
model based reinforcement learning algorithms for unstable baselines
☆15May 9, 2023Updated 3 years ago
adamkarvonen / train_ChessGPT
View on GitHub
A repository for training nanogpt-based Chess playing language models.
☆30Apr 25, 2024Updated 2 years ago
Cognitive-AI-Systems / TransPath
View on GitHub
This repository contains a deep learning-based approach for improving A* search efficiency on grid graphs. By learning instance-dependent…
☆23Aug 27, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
3DAgentWorld / LLM-Game-Agent
View on GitHub
☆24Oct 13, 2024Updated last year
csf-ngs / illuminasavr
View on GitHub
Parses and Plots Illumina SAV files
☆13Jan 30, 2019Updated 7 years ago
ftgTUGraz / LLM4ADSTest
View on GitHub
[IEEE-TITS] Official implementation of paper "A Survey on the Application of Large Language Models in Scenario-Based Testing of Automated…
☆35Jun 10, 2026Updated 3 weeks ago
iwangjian / Color4Dial
View on GitHub
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)
☆21Nov 10, 2025Updated 7 months ago
Avmb / DialogLLMScenic
View on GitHub
Dialogue-based generation of self-driving simulation scenarios using Large Language Models
☆14Oct 13, 2024Updated last year
benediktstroebl / agent-evals
View on GitHub
☆27May 28, 2025Updated last year
abdulhaim / LMRL-Gym
View on GitHub
☆116Jul 2, 2024Updated 2 years ago