rachtibat / LRP-eXplains-Transformers
Layer-Wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]
☆156Updated last month
Alternatives and similar repositories for LRP-eXplains-Transformers
Users that are interested in LRP-eXplains-Transformers are comparing it to the libraries listed below
Sorting:
- CoSy: Evaluating Textual Explanations☆16Updated 3 months ago
- An eXplainable AI toolkit with Concept Relevance Propagation and Relevance Maximization☆126Updated 11 months ago
- Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.☆225Updated 9 months ago
- A toolkit for quantitative evaluation of data attribution methods.☆45Updated 3 weeks ago
- MetaQuantus is an XAI performance tool to identify reliable evaluation metrics☆34Updated last year
- ☆12Updated this week
- ☆93Updated last month
- Using sparse coding to find distributed representations used by neural networks.☆242Updated last year
- A basic implementation of Layer-wise Relevance Propagation (LRP) in PyTorch.☆92Updated 2 years ago
- A fast, effective data attribution method for neural networks in PyTorch☆209Updated 5 months ago
- Official Code Implementation of the paper : XAI for Transformers: Better Explanations through Conservative Propagation☆63Updated 3 years ago
- Sparse Autoencoder for Mechanistic Interpretability☆246Updated 9 months ago
- Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models. Paper presented at MICCAI 2023 conference.☆19Updated last year
- ☆286Updated 3 months ago
- ☆223Updated 7 months ago
- 👋 Overcomplete is a Vision-based SAE Toolbox☆53Updated last month
- Conformal Language Modeling☆29Updated last year
- Interpretability for sequence generation models 🐛 🔍☆413Updated 2 weeks ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆64Updated 7 months ago
- Steering Llama 2 with Contrastive Activation Addition☆148Updated 11 months ago
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆52Updated last year
- Code for paper: Are Large Language Models Post Hoc Explainers?☆31Updated 9 months ago
- OpenXAI : Towards a Transparent Evaluation of Model Explanations☆246Updated 8 months ago
- AI Logging for Interpretability and Explainability🔬☆116Updated 11 months ago
- This repository collects all relevant resources about interpretability in LLMs☆343Updated 6 months ago
- ☆167Updated last month
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Works…☆14Updated 11 months ago
- ☆92Updated 3 months ago
- Concept Relevance Propagation for Localization Models, accepted at SAIAD workshop at CVPR 2023.☆14Updated last year
- ☆132Updated last year