yuplin2333 / representation-space-jailbreakLinks

Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794)
20Updated 10 months ago

Alternatives and similar repositories for representation-space-jailbreak

Users that are interested in representation-space-jailbreak are comparing it to the libraries listed below

Sorting: