kayoyin/interpret-lm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kayoyin/interpret-lm)

kayoyin / interpret-lm

Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)

☆63

Alternatives and similar repositories for interpret-lm

Users that are interested in interpret-lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xiye17 / EvalQAExpl
View on GitHub
Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.
☆17Apr 25, 2021Updated 5 years ago
mohsenfayyaz / GlobEnc
View on GitHub
[NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
☆21May 16, 2023Updated 3 years ago
LanD-FBK / benchmark-gen-explanations
View on GitHub
Codes for "Benchmarking the Generation of Fact Checking Explanations"
☆10Aug 16, 2024Updated last year
CoderPat / learning-scaffold
View on GitHub
This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"
☆20May 19, 2022Updated 4 years ago
allenai / few_shot_explanations
View on GitHub
Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"
☆29Apr 28, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
successar / FRESH
View on GitHub
☆26Jun 12, 2023Updated 3 years ago
allenai / contrastive-explanations
View on GitHub
Explaining neural decisions contrastively to alternative decisions.
☆24Mar 18, 2021Updated 5 years ago
copenlu / xai-benchmark
View on GitHub
A Diagnostic Study of Explainability Techniques for Text Classification
☆70Oct 23, 2020Updated 5 years ago
visinf / fast-axiomatic-attribution
View on GitHub
Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)
☆15Feb 24, 2026Updated 5 months ago
mohsenfayyaz / DecompX
View on GitHub
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]
☆19Jul 3, 2025Updated last year
HanjieChen / REV
View on GitHub
Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"
☆16Aug 11, 2023Updated 2 years ago
mt-upc / transformer-contributions
View on GitHub
Measuring the Mixing of Contextual Information in the Transformer
☆35May 27, 2023Updated 3 years ago
INK-USC / expl-refinement
View on GitHub
Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)
☆11Oct 25, 2021Updated 4 years ago
keyonvafa / sequential-rationales
View on GitHub
Rationales for Sequential Predictions
☆39Mar 10, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xhan77 / influence-function-analysis
View on GitHub
☆64Apr 25, 2020Updated 6 years ago
yoavgur / PISCES
View on GitHub
🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
☆13Jun 28, 2026Updated 3 weeks ago
AndreasMadsen / nlp-roar-interpretability
View on GitHub
Measuring if attention is explanation with ROAR
☆22Mar 3, 2023Updated 3 years ago
Betswish / MIRAGE
View on GitHub
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆25Mar 10, 2025Updated last year
peterbhase / LAS-NL-Explanations
View on GitHub
Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"
☆21Oct 13, 2020Updated 5 years ago
yxuansu / Contrastive_Search_versus_Contrastive_Decoding
View on GitHub
An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation
☆27Jun 7, 2024Updated 2 years ago
ZurichNLP / ContraWSD
View on GitHub
Word sense disambiguation test sets for NMT
☆21Dec 3, 2020Updated 5 years ago
U-Sharma / NeuralScaleID
View on GitHub
☆11Oct 5, 2020Updated 5 years ago
DFKI-NLP / LLMCheckup
View on GitHub
Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…
☆13Mar 24, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
inseq-team / inseq
View on GitHub
Interpretability for sequence generation models 🐛 🔍
☆471Apr 25, 2026Updated 3 months ago
TransluceAI / circuits
View on GitHub
ADAG: Transluce's MLP neuron-level circuit tracing library
☆34Apr 10, 2026Updated 3 months ago
kayoyin / Prodigy
View on GitHub
CSE201 Objected-Oriented Programming in C++: Teach an AI to produce pieces of music
☆12Jan 23, 2019Updated 7 years ago
aadityasingh / icl-dynamics
View on GitHub
☆26Feb 20, 2026Updated 5 months ago
huashen218 / convxai
View on GitHub
CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing
☆15Jun 25, 2023Updated 3 years ago
UKPLab / acl2020-confidence-regularization
View on GitHub
☆24May 22, 2023Updated 3 years ago
google / belief-localization
View on GitHub
This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…
☆62May 9, 2023Updated 3 years ago
BierOne / Attention-Faithfulness
View on GitHub
[ICML 2022] This is the pytorch implementation of "Rethinking Attention-Model Explainability through Faithfulness Violation Test" (https:…
☆20Jul 21, 2022Updated 4 years ago
sibyl-dev / Explingo
View on GitHub
Explaining ML models using LLMs
☆25Oct 21, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NLP2CT / ua-cl-nmt
View on GitHub
Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)
☆11Jun 12, 2020Updated 6 years ago
zouharvi / subset2evaluate
View on GitHub
Find informative examples to efficiently (human)-evaluate NLG models.
☆17Apr 22, 2026Updated 3 months ago
thu-coai / CTRLEval
View on GitHub
Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)
☆33Jun 6, 2022Updated 4 years ago
HCDM / XRec
View on GitHub
Models for explainable recommendation.
☆12Jan 19, 2024Updated 2 years ago
DiLi-Lab / ScanDL
View on GitHub
☆14Apr 29, 2025Updated last year
anthropics / headvis
View on GitHub
Head Vis Public Release
☆39May 4, 2026Updated 2 months ago
GChrysostomou / saloss
View on GitHub
☆11Dec 23, 2021Updated 4 years ago