interpretingdl/eacl2024_transformer_interpretability_tutorial

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/interpretingdl/eacl2024_transformer_interpretability_tutorial)

interpretingdl / eacl2024_transformer_interpretability_tutorial

Materials for EACL2024 tutorial: Transformer-specific Interpretability

☆66

Alternatives and similar repositories for eacl2024_transformer_interpretability_tutorial

Users that are interested in eacl2024_transformer_interpretability_tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

redwoodresearch / Easy-Transformer
View on GitHub
☆148Aug 4, 2024Updated last year
kanishkamisra / wugs-and-daxes
View on GitHub
Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study th…
☆22Mar 20, 2024Updated 2 years ago
jannik-brinkmann / multilingual-features
View on GitHub
Code for the paper "Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages" (N…
☆17Apr 13, 2025Updated last year
EleutherAI / attribute
View on GitHub
☆16Nov 14, 2025Updated 8 months ago
MaheepChaudhary / SAE-Ravel
View on GitHub
Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…
☆13Jan 26, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models
View on GitHub
This repository collects all relevant resources about interpretability in LLMs
☆402Nov 1, 2024Updated last year
allenai / beaker-gantry
View on GitHub
Gantry provides an API that streamlines running experiments in Beaker
☆31Updated this week
inseq-team / inseq
View on GitHub
Interpretability for sequence generation models 🐛 🔍
☆471Apr 25, 2026Updated 2 months ago
aaronmueller / MIB
View on GitHub
Landing page for MIB: A Mechanistic Interpretability Benchmark
☆26Aug 15, 2025Updated 11 months ago
UKPLab / tmlr2026-manifold-analysis
View on GitHub
☆21Mar 3, 2026Updated 4 months ago
saprmarks / feature-circuits
View on GitHub
☆223Oct 14, 2025Updated 9 months ago
evandez / relations
View on GitHub
How do transformer LMs encode relations?
☆59Feb 24, 2024Updated 2 years ago
samyadeepbasu / LocoGen
View on GitHub
Localization of Knowledge in Text-to-Image Models
☆11Oct 8, 2024Updated last year
Heidelberg-NLP / CC-SHAP-VLM
View on GitHub
Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…
☆12Jul 14, 2026Updated last week
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
hannamw / EAP-IG
View on GitHub
☆83May 23, 2026Updated 2 months ago
Aaquib111 / edge-attribution-patching
View on GitHub
Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"
☆48May 31, 2024Updated 2 years ago
meanna / ThaiLMCUT
View on GitHub
☆18May 6, 2022Updated 4 years ago
cpllab / syntactic-generalization
View on GitHub
Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"
☆31Jun 18, 2021Updated 5 years ago
Garrafao / MetaphoricChange
View on GitHub
Data and code for the experiments in: "German in Flux: Detecting Metaphoric Change via Word Entropy". Dominik Schlechtweg, Stefanie Eckma…
☆10Aug 26, 2019Updated 6 years ago
Dakingrai / awesome-mechanistic-interpretability-lm-papers
View on GitHub
☆260Nov 22, 2024Updated last year
viking-sudo-rm / industrial-stacknns
View on GitHub
Stack neural networks applied to hefty natural language tasks.
☆15Dec 26, 2019Updated 6 years ago
o-laurent / multivariate-ks-test
View on GitHub
Python implementation of an extension of the Kolmogorov-Smirnov test for multivariate samples
☆13Aug 6, 2023Updated 2 years ago
UKPLab / emnlp2018-novel-metaphors
View on GitHub
Annotations and code for the EMNLP 2018 paper 'Weeding out Conventionalized Metaphors: A Corpus of Novel Metaphor Annotations'
☆10Feb 20, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
google-deepmind / mishax
View on GitHub
☆156Updated this week
stanfordnlp / pyvene
View on GitHub
Stanford NLP Python library for understanding and improving PyTorch models via interventions
☆892Mar 6, 2026Updated 4 months ago
ViCCo-Group / semantic_features_gpt_3
View on GitHub
Code and data from semantic feature generation with GPT-3
☆17Sep 10, 2023Updated 2 years ago
chorowski-lab / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆10Feb 22, 2022Updated 4 years ago
yuri-bizzoni / Metaphor-Paraphrase
View on GitHub
☆14Jul 31, 2022Updated 3 years ago
TransformerLensOrg / TransformerLens
View on GitHub
A library for mechanistic interpretability of GPT-style language models
☆3,710Updated this week
mingfengwan / mdbootstrap-academic
View on GitHub
Elegant Material Design template for academic portfolios. 100/100 performance score.
☆13Jul 17, 2026Updated last week
Fraunhofer-AISEC / towards-resistant-audio-adversarial-examples
View on GitHub
Generation tool for offset-resistant audio adversarial examples against Deepspeech
☆10Oct 5, 2020Updated 5 years ago
acl-org / ethics-reading-list
View on GitHub
A list of ethics related resources for researchers and practitioners of Natural Language Processing and Computational Linguistics
☆34Oct 20, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FarnoushRJ / RelP
View on GitHub
[NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in La…
☆29Nov 3, 2025Updated 8 months ago
UFO-101 / auto-circuit
View on GitHub
A library for efficient patching and automatic circuit discovery.
☆99Dec 31, 2025Updated 6 months ago
technion-cs-nlp / irm-for-nli
View on GitHub
☆11Jun 2, 2022Updated 4 years ago
INK-USC / PE2
View on GitHub
Code for paper "Prompt Engineering a Prompt Engineer" (https://arxiv.org/abs/2311.05661)
☆12Aug 1, 2024Updated last year
NeuroLIAA / visions
View on GitHub
Visual Search in Natural Scenes benchmark
☆20Sep 19, 2024Updated last year
zepingyu0512 / awesome-LLM-neuron
View on GitHub
☆36Jun 13, 2025Updated last year
wenhycs / EMNLP2021-Utilizing-Relative-Event-Time-to-Enhance-Event-Event-Temporal-Relation-Extraction
View on GitHub
☆12Oct 4, 2021Updated 4 years ago