DalasNoin / redteaming

redteaming a simple language model like gpt2. based on anthropic redteaming paper
9Updated 2 years ago

Alternatives and similar repositories for redteaming:

Users that are interested in redteaming are comparing it to the libraries listed below