BeyonderXX / ShadowAlignment

Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
23Updated last year

Related projects

Alternatives and complementary repositories for ShadowAlignment