chihkuanyeh / concept_expLinks

code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"

☆53

Alternatives and similar repositories for concept_exp

Users that are interested in concept_exp are comparing it to the libraries listed below

Sorting:

pankessel / adv_explanation_ref
reference implementation for "explanations can be manipulated and geometry is to blame"
☆36Updated 3 years ago
yewsiang / ConceptBottleneck
Concept Bottleneck Models, ICML 2020
☆208Updated 2 years ago
princetonvisualai / DomainBiasMitigation
☆74Updated 5 years ago
chihkuanyeh / saliency_evaluation
Python implementation for evaluating explanations presented in "On the (In)fidelity and Sensitivity for Explanations" in NeurIPS 2019 for…
☆25Updated 3 years ago
amiratag / ACE
Towards Automatic Concept-based Explanations
☆160Updated last year
anniesch / jtt
Code for "Just Train Twice: Improving Group Robustness without Training Group Information"
☆72Updated last year
jcpeterson / cifar-10h
Original dataset release for CIFAR-10H
☆83Updated 4 years ago
zhangrh93 / InvertibleCE
Invertible Concept-based Explanation (ICE)
☆18Updated 4 years ago
zzzace2000 / FIDO-saliency
Explaining Image Classifiers by Counterfactual Generation
☆28Updated 3 years ago
arnav-gudibande / conceptSHAP
PyTorch Transformer-based Language Model Implementation of ConceptSHAP
☆14Updated 5 years ago
alinlab / LfF
Learning from Failure: Training Debiased Classifier from Biased Classifier (NeurIPS 2020)
☆91Updated 4 years ago
adebayoj / sanity_checks_saliency
☆112Updated 2 years ago
laura-rieger / deep-explanation-penalization
Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" ht…
☆128Updated 4 years ago
dmitrykazhdan / CME
CME: Concept-based Model Extraction
☆11Updated 4 years ago
izmailovpavel / spurious_feature_learning
☆46Updated 2 years ago
rmrisforbidden / Fooling_Neural_Network-Interpretations
This repository provides a PyTorch implementation of "Fooling Neural Network Interpretations via Adversarial Model Manipulation". Our pap…
☆22Updated 4 years ago
MadryLab / BREEDS-Benchmarks
☆55Updated 4 years ago
harshays / inputgradients
Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)
☆13Updated 2 years ago
singlasahil14 / barlow
Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction
☆36Updated 3 years ago
google-research-datasets / bam
☆51Updated 4 years ago
MadryLab / DebuggableDeepNetworks
☆38Updated 4 years ago
ecreager / eiil
Code for Environment Inference for Invariant Learning (ICML 2021 Paper)
☆50Updated 4 years ago
rgeirhos / shortcut-perspective
Figures & code from the paper "Shortcut Learning in Deep Neural Networks" (Nature Machine Intelligence 2020)
☆99Updated 3 years ago
microsoft / robustdg
Toolkit for building machine learning models that generalize to unseen domains and are robust to privacy and other attacks.
☆176Updated last year
SinaMohseni / Awesome-XAI-Evaluation
Reference tables to introduce and organize evaluation methods and measures for explainable machine learning systems
☆74Updated 3 years ago
HazyResearch / hidden-stratification
Combating hidden stratification with GEORGE
☆64Updated 4 years ago
singlasahil14 / salient_imagenet
Code for the ICLR 2022 paper. Salient Imagenet: How to discover spurious features in deep learning?
☆40Updated 2 years ago
PolinaKirichenko / deep_feature_reweighting
☆108Updated last year
kohpangwei / group-influence-release
☆50Updated 2 years ago
choprashweta / Adversarial-Debiasing
Implementation of Adversarial Debiasing in PyTorch to address Gender Bias
☆31Updated 5 years ago