iPieter / ethical-adversariesLinks
⚖️ Code for the paper "Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning".
☆11Updated 3 years ago
Alternatives and similar repositories for ethical-adversaries
Users that are interested in ethical-adversaries are comparing it to the libraries listed below
Sorting:
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Updated 2 years ago
- Implementation of Adversarial Debiasing in PyTorch to address Gender Bias☆31Updated 5 years ago
- General fair regression subject to demographic parity constraint. Paper appeared in ICML 2019.☆16Updated 5 years ago
- ☆109Updated 2 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"☆54Updated 3 years ago
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆25Updated 3 years ago
- Code for Active Learning at The ImageNet Scale. This repository implements many popular active learning algorithms and allows training wi…☆54Updated 4 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- This repository provides a PyTorch implementation of "Fooling Neural Network Interpretations via Adversarial Model Manipulation". Our pap…☆23Updated 4 years ago
- PyTorch code for the Neurips 2021 paper: Fairness via Representation Neutralization☆10Updated 4 years ago
- A simple PyTorch implementation of influence functions.☆91Updated last year
- ☆31Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆58Updated 2 years ago
- ☆14Updated 5 years ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆42Updated 3 years ago
- ☆38Updated 4 years ago
- ☆17Updated 5 years ago
- ☆37Updated 2 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Updated last year
- Official repo for the paper "Make Some Noise: Reliable and Efficient Single-Step Adversarial Training" (https://arxiv.org/abs/2202.01181)☆25Updated 3 years ago
- ☆32Updated 4 years ago
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 3 years ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆83Updated last year
- Python implementation for evaluating explanations presented in "On the (In)fidelity and Sensitivity for Explanations" in NeurIPS 2019 for…☆25Updated 3 years ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30Updated 3 years ago
- Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)☆21Updated 3 years ago
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆36Updated 3 years ago
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Updated 3 years ago
- ☆21Updated 4 years ago