jim-berend / semanticlensLinks
Mechanistic understanding and validation of large AI models with SemanticLens
☆48Updated last month
Alternatives and similar repositories for semanticlens
Users that are interested in semanticlens are comparing it to the libraries listed below
Sorting:
- Layer-wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]☆213Updated 5 months ago
- An eXplainable AI toolkit with Concept Relevance Propagation and Relevance Maximization☆140Updated last year
- 👋 Overcomplete is a Vision-based SAE Toolbox☆112Updated last month
- A toolkit for quantitative evaluation of data attribution methods.☆54Updated 5 months ago
- MetaQuantus is an XAI performance tool to identify reliable evaluation metrics☆40Updated last year
- Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.☆239Updated 5 months ago
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in La…☆23Updated 2 months ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" 🐍☆45Updated last year
- LENS Project☆51Updated last year
- [NeurIPS 2024] CoSy is an automatic evaluation framework for textual explanations of neurons.☆19Updated 6 months ago
- Dataset and code for the CLEVR-XAI dataset.☆33Updated 2 years ago
- Codebase for information theoretic shapley values to explain predictive uncertainty.This repo contains the code related to the paperWatso…☆22Updated last year
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Works…☆19Updated last year
- OpenXAI : Towards a Transparent Evaluation of Model Explanations☆252Updated last year
- ☆32Updated last year
- ☆57Updated 11 months ago
- [NeurIPS 2024] Code for the paper: B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable.☆38Updated 2 months ago
- PyTorch Explain: Interpretable Deep Learning in Python.☆166Updated last year
- Repository for our NeurIPS 2022 paper "Concept Embedding Models", our NeurIPS 2023 paper "Learning to Receive Help", and our ICML 2025 pa…☆72Updated 2 months ago
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆70Updated 2 years ago
- Conformal Language Modeling☆32Updated 2 years ago
- Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.☆53Updated last year
- ☆26Updated 3 weeks ago
- ☆16Updated 8 months ago
- Library implementing state-of-the-art Concept-based and Disentanglement Learning methods for Explainable AI☆55Updated 3 years ago
- 👋 Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)☆71Updated 2 years ago
- Explain Neural Networks using Layer-Wise Relevance Propagation and evaluate the explanations using Pixel-Flipping and Area Under the Curv…☆16Updated 3 years ago
- Library that provides metrics to assess representation quality☆20Updated 11 months ago
- 🪄 Interpreto is an interpretability toolbox for LLMs☆95Updated 2 weeks ago
- Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models. Paper presented at MICCAI 2023 conference.☆20Updated last year