PKU-Alignment / llms-resist-alignmentLinks
[ACL2025 Best Paper] Language Models Resist Alignment
☆34Updated 4 months ago
Alternatives and similar repositories for llms-resist-alignment
Users that are interested in llms-resist-alignment are comparing it to the libraries listed below
Sorting:
- [2025-TMLR] A Survey on the Honesty of Large Language Models