YiyiyiZhao / sirenLinks

Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models (LLMs). This repository contains the resources for reproducing the experiments described in our work.
11Updated last month

Alternatives and similar repositories for siren

Users that are interested in siren are comparing it to the libraries listed below

Sorting: