thu-coai / AISafetyLab
AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.
☆160Updated this week
Alternatives and similar repositories for AISafetyLab
Users that are interested in AISafetyLab are comparing it to the libraries listed below
Sorting:
- [ICML 2025] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping".☆112Updated last week
- Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)