Jellyfish042 / uncheatable_eval

Evaluating LLMs with Dynamic Data
75Updated last week

Alternatives and similar repositories for uncheatable_eval:

Users that are interested in uncheatable_eval are comparing it to the libraries listed below