XuandongZhao / weak-to-strongView on GitHub
[ICML 2025] Weak-to-Strong Jailbreaking on Large Language Models
91May 2, 2025Updated 10 months ago

Alternatives and similar repositories for weak-to-strong

Users that are interested in weak-to-strong are comparing it to the libraries listed below

Sorting:

Are these results useful?