visinf / fast-axiomatic-attributionView external linksLinks
Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)
☆16May 12, 2023Updated 2 years ago
Alternatives and similar repositories for fast-axiomatic-attribution
Users that are interested in fast-axiomatic-attribution are comparing it to the libraries listed below
Sorting:
- ☆13Jul 26, 2023Updated 2 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Feb 20, 2022Updated 3 years ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 8 months ago
- ☆38Oct 3, 2023Updated 2 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆12Oct 25, 2021Updated 4 years ago
- PyTorch-based library for various kinds of representational-similarity analysis☆24Jun 7, 2024Updated last year
- Provable Worst Case Guarantees for the Detection of Out-of-Distribution Data☆13Sep 20, 2022Updated 3 years ago
- ☆12Sep 26, 2019Updated 6 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆12Aug 17, 2021Updated 4 years ago
- ☆17Aug 30, 2025Updated 5 months ago
- Coupling rejection strategy against adversarial attacks (CVPR 2022)☆29Mar 2, 2022Updated 3 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Apr 28, 2023Updated 2 years ago
- Code for CVPR2021 paper: MOOD: Multi-level Out-of-distribution Detection☆38Sep 4, 2023Updated 2 years ago
- ☆19Sep 16, 2025Updated 4 months ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 6, 2026Updated last week
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Mar 24, 2023Updated 2 years ago
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in La…☆25Nov 3, 2025Updated 3 months ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- code released for our TIP 2021 paper "Adversarial Domain Adaptation with Prototype-based Normalized Output Conditioner"☆15May 24, 2023Updated 2 years ago
- Codebase for the paper "Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning"☆17Jul 12, 2021Updated 4 years ago
- Deep Learning Research☆16Nov 13, 2019Updated 6 years ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆21Apr 22, 2025Updated 9 months ago
- ☆18Oct 6, 2022Updated 3 years ago
- ☆12Sep 11, 2022Updated 3 years ago
- Saliency Cards are transparency documentation for saliency methods. Learn about new saliency methods or document your own!☆18Jun 9, 2023Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- Rationales for Sequential Predictions☆40Mar 10, 2022Updated 3 years ago
- Official Code Repo for the Paper: "How does This Interaction Affect Me? Interpretable Attribution for Feature Interactions", In NeurIPS 2…☆42Oct 31, 2022Updated 3 years ago
- 👋 Overcomplete is a Vision-based SAE Toolbox☆119Dec 4, 2025Updated 2 months ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆19May 19, 2022Updated 3 years ago
- Delta Orthogonal Initialization for PyTorch☆18Jun 27, 2018Updated 7 years ago
- ☆19Jan 27, 2021Updated 5 years ago
- Implementation of CVPR 2022 paper "Learning Distinctive Margin toward Active Domain Adaptation”☆21Apr 3, 2022Updated 3 years ago
- Neural-Backed Decision Tree sample integration with pytorch-image-models☆16Sep 18, 2020Updated 5 years ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21May 16, 2023Updated 2 years ago
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆22Oct 13, 2020Updated 5 years ago
- ☆24Jun 22, 2022Updated 3 years ago
- ☆26Nov 8, 2022Updated 3 years ago