CUHK-ARISE / GAMABench

Benchmarking LLMs' Gaming Ability in Multi-Agent Environments
33Updated this week

Related projects: