choprashweta / Adversarial-Debiasing
Implementation of Adversarial Debiasing in PyTorch to address Gender Bias
☆31Updated 4 years ago
Alternatives and similar repositories for Adversarial-Debiasing:
Users that are interested in Adversarial-Debiasing are comparing it to the libraries listed below
- General fair regression subject to demographic parity constraint. Paper appeared in ICML 2019.☆15Updated 4 years ago
- This is a collection of papers and other resources related to fairness.☆92Updated last year
- ☆37Updated 2 years ago
- Reference tables to introduce and organize evaluation methods and measures for explainable machine learning systems☆74Updated 2 years ago
- code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"☆53Updated 2 years ago
- ☆13Updated 4 years ago
- ☆22Updated 6 years ago
- Code for "Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior?"☆46Updated last year
- This is a benchmark to evaluate machine learning local explanaitons quality generated from any explainer for text and image data☆30Updated 3 years ago
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆50Updated 3 years ago
- Adversarial Attacks on Post Hoc Explanation Techniques (LIME/SHAP)☆82Updated 2 years ago
- ☆14Updated 5 years ago
- ⚖️ Code for the paper "Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning".☆11Updated 2 years ago
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆35Updated 2 years ago
- Official repository for CMU Machine Learning Department's 10732: Robustness and Adaptivity in Shifting Environments☆73Updated 2 years ago
- ☆38Updated 3 years ago
- CVPR'19 experiments with (on-manifold) adversarial examples.☆44Updated 5 years ago
- Python implementation for evaluating explanations presented in "On the (In)fidelity and Sensitivity for Explanations" in NeurIPS 2019 for…☆25Updated 3 years ago
- Implementation of Minimax Pareto Fairness framework☆21Updated 4 years ago
- Library implementing state-of-the-art Concept-based and Disentanglement Learning methods for Explainable AI☆53Updated 2 years ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30Updated 2 years ago
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆18Updated 2 years ago
- A pytorch implemention of the Explainable AI work 'Contrastive layerwise relevance propagation (CLRP)'☆17Updated 2 years ago
- Code and data for the experiments in "On Fairness and Calibration"☆50Updated 2 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"☆17Updated 2 years ago
- Hands-on tutorial on ML Fairness☆70Updated last year
- NumPy library for calibration metrics☆69Updated 3 weeks ago
- Code to reproduce our paper on probabilistic algorithmic recourse: https://arxiv.org/abs/2006.06831☆36Updated 2 years ago
- ☆87Updated last year