ayyucekizrak / Mechanistic-InterpretabilityLinks

Mechanistic Interpretability in Transformers: This repository explores advanced techniques like Induction Head Detection and QK Circuit Analysis to uncover the inner workings of transformer-based models.
22Updated 8 months ago

Alternatives and similar repositories for Mechanistic-Interpretability

Users that are interested in Mechanistic-Interpretability are comparing it to the libraries listed below

Sorting: