rachtibat / LRP-eXplains-Transformers
Layer-Wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]
☆118Updated last month
Alternatives and similar repositories for LRP-eXplains-Transformers:
Users that are interested in LRP-eXplains-Transformers are comparing it to the libraries listed below
- An eXplainable AI toolkit with Concept Relevance Propagation and Relevance Maximization☆122Updated 7 months ago
- A basic implementation of Layer-wise Relevance Propagation (LRP) in PyTorch.☆86Updated 2 years ago
- MetaQuantus is an XAI performance tool to identify reliable evaluation metrics☆33Updated 9 months ago
- CoSy: Evaluating Textual Explanations☆13Updated last week
- A toolkit for quantitative evaluation of data attribution methods.☆39Updated this week
- Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.☆209Updated 6 months ago
- Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models. Paper presented at MICCAI 2023 conference.☆19Updated last year
- Official Code Implementation of the paper : XAI for Transformers: Better Explanations through Conservative Propagation☆63Updated 2 years ago
- A repository for summaries of recent explainable AI/Interpretable ML approaches☆71Updated 3 months ago
- Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.☆34Updated 2 months ago
- A PyTorch 1.6 implementation of Layer-Wise Relevance Propagation (LRP).☆132Updated 3 years ago
- 👋 Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)☆61Updated last year
- Explain Neural Networks using Layer-Wise Relevance Propagation and evaluate the explanations using Pixel-Flipping and Area Under the Curv…☆15Updated 2 years ago
- Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers, Paper accepted at eXCV workshop of ECCV 2…☆18Updated 3 weeks ago
- OpenXAI : Towards a Transparent Evaluation of Model Explanations☆237Updated 5 months ago
- Code for the paper "Post-hoc Concept Bottleneck Models". Spotlight @ ICLR 2023☆73Updated 8 months ago
- ☆220Updated last week
- Concept Relevance Propagation for Localization Models, accepted at SAIAD workshop at CVPR 2023.☆13Updated last year
- A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled concept dat…☆85Updated 9 months ago
- Using sparse coding to find distributed representations used by neural networks.☆210Updated last year
- Sparse Autoencoder for Mechanistic Interpretability☆214Updated 6 months ago
- ☆116Updated last year
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Works…☆12Updated 8 months ago
- A simple PyTorch implementation of influence functions.☆84Updated 7 months ago
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆82Updated 8 months ago
- A fast, effective data attribution method for neural networks in PyTorch☆189Updated 2 months ago
- A resource repository for representation engineering in large language models☆98Updated 2 months ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆61Updated 3 months ago
- Concept Bottleneck Models, ICML 2020☆185Updated last year
- Sparse probing paper full code.☆54Updated last year