iPieter / ethical-adversariesLinks

⚖️ Code for the paper "Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning".

☆11

Alternatives and similar repositories for ethical-adversaries

Users that are interested in ethical-adversaries are comparing it to the libraries listed below

Sorting:

chihkuanyeh / concept_exp
code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"
☆53Updated 3 years ago
rmrisforbidden / Fooling_Neural_Network-Interpretations
This repository provides a PyTorch implementation of "Fooling Neural Network Interpretations via Adversarial Model Manipulation". Our pap…
☆22Updated 4 years ago
amiratag / DistributionalShapley
Distributional Shapley: A Distributional Framework for Data Valuation
☆30Updated last year
tml-epfl / adv-training-corruptions
On the effectiveness of adversarial training against common corruptions [UAI 2022]
☆30Updated 3 years ago
steven7woo / fair_regression_reduction
General fair regression subject to demographic parity constraint. Paper appeared in ICML 2019.
☆16Updated 4 years ago
pdejorge / N-FGSM
Official repo for the paper "Make Some Noise: Reliable and Efficient Single-Step Adversarial Training" (https://arxiv.org/abs/2202.01181)
☆25Updated 2 years ago
zleizzo / datadeletion
☆14Updated 5 years ago
singlasahil14 / barlow
Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction
☆36Updated 3 years ago
pratyushmaini / localizing-memorization
Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"
☆18Updated last year
facebookresearch / fisher_information_loss
This code reproduces the results of the paper, "Measuring Data Leakage in Machine-Learning Models with Fisher Information"
☆50Updated 3 years ago
dedeswim / vits-robustness-torch
Code for the paper "A Light Recipe to Train Robust Vision Transformers" [SaTML 2023]
☆52Updated 2 years ago
DequanWang / dent
Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks
☆39Updated 4 years ago
cassidylaidlaw / perceptual-advex
Code and data for the ICLR 2021 paper "Perceptual Adversarial Robustness: Defense Against Unseen Threat Models".
☆55Updated 3 years ago
BorealisAI / mma_training
Code for the paper "MMA Training: Direct Input Space Margin Maximization through Adversarial Training"
☆34Updated 5 years ago
jh-jeong / smoothmix
Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)
☆21Updated 2 years ago
izmailovpavel / spurious_feature_learning
☆45Updated 2 years ago
LTS4 / hold-me-tight
Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"
☆22Updated 3 years ago
zeyademam / active_learning
Code for Active Learning at The ImageNet Scale. This repository implements many popular active learning algorithms and allows training wi…
☆53Updated 3 years ago
ykwon0407 / beta_shapley
Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)
☆41Updated 2 years ago
kohpangwei / group-influence-release
☆50Updated 2 years ago
litian96 / TERM
Tilted Empirical Risk Minimization (ICLR '21)
☆59Updated last year
chihkuanyeh / saliency_evaluation
Python implementation for evaluating explanations presented in "On the (In)fidelity and Sensitivity for Explanations" in NeurIPS 2019 for…
☆25Updated 3 years ago
harshays / inputgradients
Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)
☆13Updated 2 years ago
sayakpaul / robustness-foundation-models
This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.
☆70Updated 2 years ago
kyleliang919 / Uncovering-the-Connections-BetweenAdversarial-Transferability-and-Knowledge-Transferability
code for ICML 2021 paper in which we explore the relationship between adversarial transferability and knowledge transferability.
☆17Updated 2 years ago
mndu / RNF-Fairness
PyTorch code for the Neurips 2021 paper: Fairness via Representation Neutralization
☆10Updated 3 years ago
google-research / heldout-influence-estimation
☆62Updated 4 years ago
IBM / model-sanitization
Codes for reproducing the results of the paper "Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness" published at IC…
☆27Updated 5 years ago
ebagdasa / differential-privacy-vs-fairness
Code for "Differential Privacy Has Disparate Impact on Model Accuracy" NeurIPS'19
☆34Updated 4 years ago
davidstutz / disentangling-robustness-generalization
CVPR'19 experiments with (on-manifold) adversarial examples.
☆45Updated 5 years ago