microsoft / SmartPlay

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.
129Updated 10 months ago

Alternatives and similar repositories for SmartPlay:

Users that are interested in SmartPlay are comparing it to the libraries listed below