fiveai / understanding_safety_finetuningView on GitHub
Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)
12Oct 31, 2024Updated last year

Alternatives and similar repositories for understanding_safety_finetuning

Users that are interested in understanding_safety_finetuning are comparing it to the libraries listed below

Sorting:

Are these results useful?