ZHZisZZ / emulated-disalignmentLinks

[ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
36Updated 10 months ago

Alternatives and similar repositories for emulated-disalignment

Users that are interested in emulated-disalignment are comparing it to the libraries listed below

Sorting: