meg-tong / sycophancy-eval

datasets from the paper "Towards Understanding Sycophancy in Language Models"
63Updated last year

Related projects

Alternatives and complementary repositories for sycophancy-eval