YiyiyiZhao / sirenLinks

Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models (LLMs). This repository contains the resources for reproducing the experiments described in our work.
11Updated 5 months ago

Alternatives and similar repositories for siren

Users that are interested in siren are comparing it to the libraries listed below

Sorting: