BeyonderXX / ShadowAlignmentView on GitHub
Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
34Oct 19, 2023Updated 2 years ago

Alternatives and similar repositories for ShadowAlignment

Users that are interested in ShadowAlignment are comparing it to the libraries listed below

Sorting:

Are these results useful?