THU-KEG / SafetyNeuronView on GitHub
Data and code for the paper: Finding Safety Neurons in Large Language Models
21Jan 29, 2026Updated last month

Alternatives and similar repositories for SafetyNeuron

Users that are interested in SafetyNeuron are comparing it to the libraries listed below

Sorting:

Are these results useful?