THU-KEG / SafetyNeuronLinks

Data and code for the paper: Finding Safety Neurons in Large Language Models
20Updated this week

Alternatives and similar repositories for SafetyNeuron

Users that are interested in SafetyNeuron are comparing it to the libraries listed below

Sorting: