Jellyfish042 / uncheatable_eval

Evaluating LLMs with Dynamic Data
72Updated last week

Related projects

Alternatives and complementary repositories for uncheatable_eval