chihkuanyeh / concept_exp
code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"
☆53Updated 2 years ago
Alternatives and similar repositories for concept_exp:
Users that are interested in concept_exp are comparing it to the libraries listed below
- reference implementation for "explanations can be manipulated and geometry is to blame"☆36Updated 2 years ago
- PyTorch Transformer-based Language Model Implementation of ConceptSHAP☆12Updated 4 years ago
- ☆73Updated 4 years ago
- This repository provides a PyTorch implementation of "Fooling Neural Network Interpretations via Adversarial Model Manipulation". Our pap…☆22Updated 4 years ago
- Invertible Concept-based Explanation (ICE)☆18Updated 3 years ago
- Python implementation for evaluating explanations presented in "On the (In)fidelity and Sensitivity for Explanations" in NeurIPS 2019 for…☆25Updated 2 years ago
- Explaining Image Classifiers by Counterfactual Generation☆28Updated 2 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆70Updated 9 months ago
- ☆51Updated 4 years ago
- ☆109Updated 2 years ago
- Concept Bottleneck Models, ICML 2020☆188Updated last year
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆50Updated 3 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆41Updated last year
- ☆54Updated 4 years ago
- ☆46Updated 4 years ago
- Original dataset release for CIFAR-10H☆82Updated 4 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- Pytorch library for model calibration metrics and visualizations as well as recalibration methods. In progress!☆70Updated this week
- Learning from Failure: Training Debiased Classifier from Biased Classifier (NeurIPS 2020)☆90Updated 4 years ago
- A simple PyTorch implementation of influence functions.☆84Updated 8 months ago
- The Pitfalls of Simplicity Bias in Neural Networks [NeurIPS 2020] (http://arxiv.org/abs/2006.07710v2)☆39Updated last year
- CME: Concept-based Model Extraction☆12Updated 4 years ago
- Code for the ICLR 2022 paper. Salient Imagenet: How to discover spurious features in deep learning?☆38Updated 2 years ago
- Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)☆128Updated 3 years ago
- ☆44Updated 2 years ago
- Quantitative Testing with Concept Activation Vectors in PyTorch☆42Updated 5 years ago
- Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" ht…☆127Updated 3 years ago
- ☆50Updated last year
- Codes for reproducing the experimental results in "Proper Network Interpretability Helps Adversarial Robustness in Classification", publi…☆13Updated 4 years ago
- [ICLR'22] Self-supervised learning optimally robust representations for domain shift.☆23Updated 3 years ago