XuandongZhao / weak-to-strong
View external linksLinks

[ICML 2025] Weak-to-Strong Jailbreaking on Large Language Models
90May 2, 2025Updated 9 months ago

Alternatives and similar repositories for weak-to-strong

Users that are interested in weak-to-strong are comparing it to the libraries listed below

Sorting:

Are these results useful?