ayyucekizrak / Mechanistic-Interpretability

Mechanistic Interpretability in Transformers: This repository explores advanced techniques like Induction Head Detection and QK Circuit Analysis to uncover the inner workings of transformer-based models.
13Updated last month

Related projects

Alternatives and complementary repositories for Mechanistic-Interpretability