ayyucekizrak / Mechanistic-Interpretability

Mechanistic Interpretability in Transformers: This repository explores advanced techniques like Induction Head Detection and QK Circuit Analysis to uncover the inner workings of transformer-based models.
18Updated 3 months ago

Alternatives and similar repositories for Mechanistic-Interpretability:

Users that are interested in Mechanistic-Interpretability are comparing it to the libraries listed below