choprashweta / Adversarial-Debiasing
Implementation of Adversarial Debiasing in PyTorch to address Gender Bias
☆30Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Adversarial-Debiasing
- General fair regression subject to demographic parity constraint. Paper appeared in ICML 2019.☆14Updated 4 years ago
- ☆35Updated last year
- ☆13Updated 3 years ago
- code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"☆51Updated 2 years ago
- Reference tables to introduce and organize evaluation methods and measures for explainable machine learning systems☆73Updated 2 years ago
- ☆15Updated 4 years ago
- This is a collection of papers and other resources related to fairness.☆92Updated last year
- ☆22Updated 5 years ago
- Python implementation for evaluating explanations presented in "On the (In)fidelity and Sensitivity for Explanations" in NeurIPS 2019 for…☆24Updated 2 years ago
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆49Updated 3 years ago
- Distributional Shapley: A Distributional Framework for Data Valuation☆30Updated 6 months ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30Updated 2 years ago
- Implementation of Minimax Pareto Fairness framework☆21Updated 4 years ago
- PyTorch code for the Neurips 2021 paper: Fairness via Representation Neutralization☆9Updated 3 years ago
- Pytorch library for model calibration metrics and visualizations as well as recalibration methods. In progress!☆68Updated 6 months ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆61Updated 8 months ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆38Updated 2 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆66Updated 5 months ago
- A reproduced PyTorch implementation of the Adversarially Reweighted Learning (ARL) model, originally presented in "Fairness without Demog…☆20Updated 3 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated last year
- ☆72Updated 4 years ago
- Code to reproduce our paper on probabilistic algorithmic recourse: https://arxiv.org/abs/2006.06831☆34Updated last year
- ⚖️ Code for the paper "Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning".☆11Updated last year
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆19Updated 2 years ago
- Adversarial Attacks on Post Hoc Explanation Techniques (LIME/SHAP)☆80Updated last year
- ☆48Updated last year
- A simple PyTorch implementation of influence functions.☆79Updated 4 months ago
- This is a benchmark to evaluate machine learning local explanaitons quality generated from any explainer for text and image data☆30Updated 3 years ago
- Learning from Failure: Training Debiased Classifier from Biased Classifier (NeurIPS 2020)☆89Updated 4 years ago
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆12Updated 3 years ago