Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Workshop (spotlight)
☆20May 29, 2024Updated 2 years ago
Alternatives and similar repositories for PURE
Users that are interested in PURE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prototypical Concept-based Explanations, accepted at SAIAD workshop at CVPR 2024.☆16Feb 20, 2026Updated 3 months ago
- ☆17Jun 3, 2026Updated last week
- Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models. Paper presented at MICCAI 2023 conference.☆20Jan 17, 2024Updated 2 years ago
- A toolkit for quantitative evaluation of data attribution methods.☆60May 11, 2026Updated last month
- ☆15Nov 3, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆34Nov 15, 2023Updated 2 years ago
- Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers, Paper accepted at eXCV workshop of ECCV 2…☆30Jan 6, 2025Updated last year
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- CoRelAy is a tool to compose small-scale (single-machine) analysis pipelines.☆32Apr 30, 2026Updated last month
- A Robot that classifies digits and shapes☆10Jul 10, 2019Updated 6 years ago
- An eXplainable AI toolkit with Concept Relevance Propagation and Relevance Maximization☆144Jan 14, 2026Updated 4 months ago
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).☆366Jul 23, 2025Updated 10 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- A tiny easily hackable implementation of a feature dashboard.☆16Oct 21, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.☆246May 13, 2026Updated 3 weeks ago
- ☆14Jun 12, 2023Updated 3 years ago
- [ICML 24] A novel automated neuron explanation framework that can accurately describe poly-semantic concepts in deep neural networks☆14May 2, 2025Updated last year
- Official implemention of the paper High-Resolution and Precise Counterfactual Medical Image Generation using Language-guided Stable Diffu…☆23Jul 8, 2025Updated 11 months ago
- Layer-wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]☆237Jul 11, 2025Updated 11 months ago
- Mapping out the "memory" of neural nets with data attribution☆58Jun 5, 2026Updated last week
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆30Jan 26, 2025Updated last year
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Jul 12, 2024Updated last year
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Concept Relevance Propagation for Localization Models, accepted at SAIAD workshop at CVPR 2023.☆15Jan 16, 2024Updated 2 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in La…☆29Nov 3, 2025Updated 7 months ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 3 years ago
- Official code for "Good Teachers Explain: Explanation-enhanced Knowledge Distillation". ECCV 2024☆20Oct 30, 2024Updated last year
- Explainable AI in Julia.☆118Updated this week
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Apr 14, 2026Updated last month
- A simple and easily modifiable and expandable text adventure game engine☆12Jun 9, 2020Updated 6 years ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the ICLR 2022 paper. Salient Imagenet: How to discover spurious features in deep learning?☆41Aug 19, 2022Updated 3 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆14Feb 13, 2023Updated 3 years ago
- Code accompanying "Dynamic Predictive Coding: A Model of Hierarchical Sequence Learning and Prediction in the Neocortex"☆10Mar 2, 2025Updated last year
- Understanding Rare Spurious Correlations in Neural Network☆12Jun 5, 2022Updated 4 years ago
- ☆22Oct 18, 2024Updated last year
- ☆54Oct 23, 2023Updated 2 years ago
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47May 31, 2024Updated 2 years ago