listen0425 / Safety-LayersView on GitHub
code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)
22Apr 26, 2025Updated 10 months ago

Alternatives and similar repositories for Safety-Layers

Users that are interested in Safety-Layers are comparing it to the libraries listed below

Sorting:

Are these results useful?