XuandongZhao / weak-to-strong

Weak-to-Strong Jailbreaking on Large Language Models
67Updated 9 months ago

Related projects

Alternatives and complementary repositories for weak-to-strong