reds-lab / BEEARView on GitHub
This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models".
22Jul 3, 2024Updated last year

Alternatives and similar repositories for BEEAR

Users that are interested in BEEAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?