Jiaxin-Wen / MisleadLMLinks

Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""
18Updated last year

Alternatives and similar repositories for MisleadLM

Users that are interested in MisleadLM are comparing it to the libraries listed below

Sorting: