cooperleong00 / ToxificationReversal

Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)
15Updated last year

Related projects

Alternatives and complementary repositories for ToxificationReversal