Zhang-Yihao / Adversarial-Representation-Engineering

Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.
17Updated 2 months ago

Alternatives and similar repositories for Adversarial-Representation-Engineering:

Users that are interested in Adversarial-Representation-Engineering are comparing it to the libraries listed below