XuandongZhao / weak-to-strong

Weak-to-Strong Jailbreaking on Large Language Models
73Updated 10 months ago

Alternatives and similar repositories for weak-to-strong:

Users that are interested in weak-to-strong are comparing it to the libraries listed below