XuandongZhao / weak-to-strong

[ICML 2025] Weak-to-Strong Jailbreaking on Large Language Models
74Updated this week

Alternatives and similar repositories for weak-to-strong:

Users that are interested in weak-to-strong are comparing it to the libraries listed below