research-outcome / LLM-Game-BenchmarkView on GitHub
Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard
25Dec 14, 2024Updated last year

Alternatives and similar repositories for LLM-Game-Benchmark

Users that are interested in LLM-Game-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?