wollschlager / geometry-of-refusalView on GitHub
Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence
26Jul 31, 2025Updated 7 months ago

Alternatives and similar repositories for geometry-of-refusal

Users that are interested in geometry-of-refusal are comparing it to the libraries listed below

Sorting:

Are these results useful?