meg-tong / sycophancy-eval

datasets from the paper "Towards Understanding Sycophancy in Language Models"
59Updated 10 months ago

Related projects: