microsoft / LLF-Bench

A benchmark for evaluating learning agents based on just language feedback
56Updated last month

Related projects

Alternatives and complementary repositories for LLF-Bench