rachtibat / LRP-eXplains-TransformersLinks
Layer-Wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]
☆164Updated 2 months ago
Alternatives and similar repositories for LRP-eXplains-Transformers
Users that are interested in LRP-eXplains-Transformers are comparing it to the libraries listed below
Sorting:
- An eXplainable AI toolkit with Concept Relevance Propagation and Relevance Maximization☆129Updated 11 months ago
- Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.☆226Updated 10 months ago
- A toolkit for quantitative evaluation of data attribution methods.☆47Updated last month
- CoSy: Evaluating Textual Explanations☆16Updated 4 months ago
- ☆13Updated 3 weeks ago
- MetaQuantus is an XAI performance tool to identify reliable evaluation metrics☆34Updated last year
- Official Code Implementation of the paper : XAI for Transformers: Better Explanations through Conservative Propagation☆63Updated 3 years ago
- Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models. Paper presented at MICCAI 2023 conference.☆19Updated last year
- A basic implementation of Layer-wise Relevance Propagation (LRP) in PyTorch.☆94Updated 2 years ago
- 👋 Overcomplete is a Vision-based SAE Toolbox☆57Updated 2 months ago
- ☆97Updated last month
- Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers, Paper accepted at eXCV workshop of ECCV 2…☆24Updated 4 months ago
- OpenXAI : Towards a Transparent Evaluation of Model Explanations☆247Updated 9 months ago
- A PyTorch 1.6 implementation of Layer-Wise Relevance Propagation (LRP).☆136Updated 4 years ago
- Concept Relevance Propagation for Localization Models, accepted at SAIAD workshop at CVPR 2023.☆14Updated last year
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆54Updated last year
- Explain Neural Networks using Layer-Wise Relevance Propagation and evaluate the explanations using Pixel-Flipping and Area Under the Curv…☆16Updated 2 years ago
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Works…☆14Updated last year
- Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.☆44Updated 7 months ago
- A resource repository for representation engineering in large language models☆124Updated 6 months ago
- LENS Project☆48Updated last year
- ☆36Updated 2 months ago
- Using sparse coding to find distributed representations used by neural networks.☆247Updated last year
- ☆70Updated 2 years ago
- Conformal Language Modeling☆29Updated last year
- Code for paper: Are Large Language Models Post Hoc Explainers?☆31Updated 10 months ago
- Sparse Autoencoder for Mechanistic Interpretability☆248Updated 10 months ago
- ☆11Updated last month
- CoRelAy is a tool to compose small-scale (single-machine) analysis pipelines.☆28Updated this week
- This repository collects all relevant resources about interpretability in LLMs☆353Updated 7 months ago