ZHZisZZ / emulated-disalignment

[ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
33Updated 5 months ago

Alternatives and similar repositories for emulated-disalignment:

Users that are interested in emulated-disalignment are comparing it to the libraries listed below