dmitrykazhdan / CMELinks

CME: Concept-based Model Extraction

☆11

Alternatives and similar repositories for CME

Users that are interested in CME are comparing it to the libraries listed below

Sorting:

chihkuanyeh / concept_exp
code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"
☆53Updated 3 years ago
asmadotgh / dissect
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals
☆11Updated last year
JonathanCrabbe / CARs
This repository contains the implementation of Concept Activation Regions, a new framework to explain deep neural networks with human con…
☆12Updated 2 years ago
MadryLab / DebuggableDeepNetworks
☆38Updated 4 years ago
dmitrykazhdan / concept-based-xai
Library implementing state-of-the-art Concept-based and Disentanglement Learning methods for Explainable AI
☆55Updated 2 years ago
AmanDaVinci / SENN
Self-Explaining Neural Networks
☆42Updated 5 years ago
princetonvisualai / OverlookedFactors
Overlooked Factors in Concept-based Explanations: Dataset Choice, Concept Learnability, and Human Capability (CVPR 2023)
☆9Updated 2 years ago
zzzace2000 / FIDO-saliency
Explaining Image Classifiers by Counterfactual Generation
☆28Updated 3 years ago
keiserlab / rcav
'Robust Semantic Interpretability: Revisiting Concept Activation Vectors' Official Implementation
☆11Updated 5 years ago
IBM / concept_transformer
Code for the ICLR 2022 paper "Attention-based interpretability with Concept Transformers"
☆40Updated 2 months ago
tml-epfl / adv-training-corruptions
On the effectiveness of adversarial training against common corruptions [UAI 2022]
☆30Updated 3 years ago
amiratag / DistributionalShapley
Distributional Shapley: A Distributional Framework for Data Valuation
☆30Updated last year
MadryLab / failure-directions
Distilling Model Failures as Directions in Latent Space
☆47Updated 2 years ago
davidstutz / disentangling-robustness-generalization
CVPR'19 experiments with (on-manifold) adversarial examples.
☆45Updated 5 years ago
dmelis / SENN
Self-Explaining Neural Networks
☆13Updated last year
liuchen11 / AdversaryLossLandscape
On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them [NeurIPS 2020]
☆36Updated 4 years ago
ecreager / eiil
Code for Environment Inference for Invariant Learning (ICML 2021 Paper)
☆50Updated 4 years ago
ahujak / IB-IRM
☆31Updated 3 years ago
anirbansarkar-cs / Ante-hoc_Explainability_Concepts
Implementation of the paper "A Framework for Learning Ante-hoc Explainable Models via Concepts" (CVPR 2022).
☆8Updated last year
izmailovpavel / spurious_feature_learning
☆45Updated 2 years ago
huaxiuyao / LISA
LISA for ICML 2022
☆50Updated 2 years ago
yangarbiter / rare-spurious-correlation
Understanding Rare Spurious Correlations in Neural Network
☆12Updated 3 years ago
siplab-gt / generative-causal-explanations
Code for "Generative causal explanations of black-box classifiers"
☆34Updated 4 years ago
locuslab / perturbation_learning
Learning perturbation sets for robust machine learning
☆65Updated 3 years ago
rmrisforbidden / Fooling_Neural_Network-Interpretations
This repository provides a PyTorch implementation of "Fooling Neural Network Interpretations via Adversarial Model Manipulation". Our pap…
☆22Updated 4 years ago
ryoungj / optdom
[ICLR'22] Self-supervised learning optimally robust representations for domain shift.
☆24Updated 3 years ago
ServiceNow / beyond-trivial-explanations
Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations is a ServiceNow Research project that was started at Elemen…
☆13Updated last year
MadryLab / BREEDS-Benchmarks
☆55Updated 4 years ago
peterbhase / interpretable-image
Code for "Interpretable Image Recognition with Hierarchical Prototypes"
☆18Updated 5 years ago
yjhuangcd / local-lipschitz
Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).
☆24Updated 2 years ago