research-outcome / LLM-Game-BenchmarkLinks

Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard
21Updated last year

Alternatives and similar repositories for LLM-Game-Benchmark

Users that are interested in LLM-Game-Benchmark are comparing it to the libraries listed below

Sorting: