hendrycks / test

Measuring Massive Multitask Language Understanding | ICLR 2021
1,148Updated last year

Related projects: