XuandongZhao / weak-to-strongLinks

[ICML 2025] Weak-to-Strong Jailbreaking on Large Language Models
76Updated last month

Alternatives and similar repositories for weak-to-strong

Users that are interested in weak-to-strong are comparing it to the libraries listed below

Sorting: