MikaStars39 / FeatureAlignmentLinks

FeatureAlignment = Alignment + Mechanistic Interpretability
28Updated 3 months ago

Alternatives and similar repositories for FeatureAlignment

Users that are interested in FeatureAlignment are comparing it to the libraries listed below

Sorting: