IBM / sensitive-subspace-robustness
☆13Updated 3 years ago
Alternatives and similar repositories for sensitive-subspace-robustness:
Users that are interested in sensitive-subspace-robustness are comparing it to the libraries listed below
- ☆17Updated 4 years ago
- ☆50Updated last year
- Implementation of Adversarial Debiasing in PyTorch to address Gender Bias☆30Updated 4 years ago
- Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)☆15Updated 4 years ago
- ☆18Updated 3 years ago
- Code for "Imitation Attacks and Defenses for Black-box Machine Translations Systems"☆36Updated 4 years ago
- Learning Certified Individually Fair Representations☆24Updated 4 years ago
- ☆9Updated 4 years ago
- TextHide: Tackling Data Privacy in Language Understanding Tasks☆31Updated 3 years ago
- ☆38Updated 3 years ago
- [Preprint] On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shaping☆10Updated 4 years ago
- ☆86Updated last year
- ☆15Updated 4 years ago
- ☆62Updated 3 years ago
- OOD Generalization and Detection (ACL 2020)☆60Updated 4 years ago
- Codes for reproducing the results of the paper "Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness" published at IC…☆26Updated 4 years ago
- Code for "Neuron Shapley: Discovering the Responsible Neurons"☆23Updated 8 months ago
- Code for the paper "Weight Poisoning Attacks on Pre-trained Models" (ACL 2020)☆140Updated 3 years ago
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆19Updated 2 years ago
- ☆32Updated 7 years ago
- ☆22Updated 5 years ago
- Code for "Differential Privacy Has Disparate Impact on Model Accuracy" NeurIPS'19☆35Updated 3 years ago
- Codes for reproducing the contrastive explanation in “Explanations based on the Missing: Towards Contrastive Explanations with Pertinent…☆54Updated 6 years ago
- Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).☆22Updated 2 years ago
- Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)☆28Updated 4 years ago
- Implementation of Minimax Pareto Fairness framework☆21Updated 4 years ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30Updated 2 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Updated 4 years ago
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆50Updated 3 years ago