hbaniecki / adversarial-explainable-ai
π‘ Adversarial attacks on explanations and how to defend them
β312Updated 3 months ago
Alternatives and similar repositories for adversarial-explainable-ai:
Users that are interested in adversarial-explainable-ai are comparing it to the libraries listed below
- Adversarial Attacks on Post Hoc Explanation Techniques (LIME/SHAP)β82Updated 2 years ago
- OpenXAI : Towards a Transparent Evaluation of Model Explanationsβ240Updated 7 months ago
- A curated list of papers on adversarial machine learning (adversarial examples and defense methods).β210Updated 2 years ago
- Interesting resources related to Explainable Artificial Intelligence, Interpretable Machine Learning, Interactive Machine Learning, Humanβ¦β73Updated 2 years ago
- reference implementation for "explanations can be manipulated and geometry is to blame"β36Updated 2 years ago
- Library containing PyTorch implementations of various adversarial attacks and resourcesβ151Updated 3 weeks ago
- RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]β702Updated last month
- A library for experimenting with, training and evaluating neural networks, with a focus on adversarial robustness.β930Updated last year
- Related papers for robust machine learningβ568Updated last year
- Code for "On Adaptive Attacks to Adversarial Example Defenses"β86Updated 4 years ago
- All about explainable AI, algorithmic fairness and moreβ107Updated last year
- β142Updated 5 months ago
- A Python library for Secure and Explainable Machine Learningβ172Updated last month
- A unified benchmark problem for data poisoning attacksβ153Updated last year
- β122Updated 3 years ago
- This repository provides simple PyTorch implementations for adversarial training methods on CIFAR-10.β162Updated 4 years ago
- Quantus is an eXplainable AI toolkit for responsible evaluation of neural network explanationsβ594Updated last month
- pyDVL is a library of stable implementations of algorithms for data valuation and influence function computationβ116Updated this week
- A curated list of awesome Fairness in AI resourcesβ317Updated last year
- Datasets derived from US census dataβ255Updated 10 months ago
- A curated list of trustworthy deep learning papers. Daily updating...β359Updated last week
- Code relative to "Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks"β683Updated 10 months ago
- List of relevant resources for machine learning from explanatory supervisionβ156Updated 2 months ago
- A repository to quickly generate synthetic data and associated trojaned deep learning modelsβ77Updated last year
- Provable adversarial robustness at ImageNet scaleβ383Updated 5 years ago
- π Influenciae is a Tensorflow Toolbox for Influence Functionsβ61Updated 11 months ago
- π A curated list of awesome real-world adversarial examples resourcesβ58Updated 4 years ago
- Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" htβ¦β127Updated 4 years ago
- Papers and code of Explainable AI esp. w.r.t. Image classificiationβ204Updated 2 years ago
- CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithmsβ286Updated last year