EleutherAI / radioactive-labLinks
Adapting the "Radioactive Data" paper to work for text models
☆9Updated 4 years ago
Alternatives and similar repositories for radioactive-lab
Users that are interested in radioactive-lab are comparing it to the libraries listed below
Sorting:
- Provably defending pretrained classifiers including the Azure, Google, AWS, and Clarifai APIs☆97Updated 4 years ago
- Convex Layerwise Adversarial Training (COLT)☆28Updated 4 years ago
- Code and data for the ICLR 2021 paper "Perceptual Adversarial Robustness: Defense Against Unseen Threat Models".☆55Updated 3 years ago
- ConvexPolytopePosioning☆35Updated 5 years ago
- Implementation of Wasserstein adversarial attacks.☆23Updated 4 years ago
- Source code for the paper "Exploiting Excessive Invariance caused by Norm-Bounded Adversarial Robustness"☆25Updated 5 years ago
- Code for the paper "Weight Poisoning Attacks on Pre-trained Models" (ACL 2020)☆141Updated 3 years ago
- CVPR 2021 Official repository for the Data-Free Model Extraction paper. https://arxiv.org/abs/2011.14779☆72Updated last year
- Code for the unrestricted adversarial examples paper (NeurIPS 2018)☆64Updated 5 years ago
- ☆125Updated 3 years ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30Updated 3 years ago
- ☆65Updated last year
- Code for Stability Training with Noise (STN)☆22Updated 4 years ago
- Witches' Brew: Industrial Scale Data Poisoning via Gradient Matching☆102Updated 9 months ago
- Black-Box Ripper: Copying black-box models using generative evolutionary algorithms - NIPS 2020 - Official Implementation☆28Updated 4 years ago
- RayS: A Ray Searching Method for Hard-label Adversarial Attack (KDD2020)☆56Updated 4 years ago
- ☆16Updated 2 years ago
- [ICML'20] Multi Steepest Descent (MSD) for robustness against the union of multiple perturbation models.☆26Updated 10 months ago
- Code for our NeurIPS 2019 *spotlight* "Provably Robust Deep Learning via Adversarially Trained Smoothed Classifiers"☆225Updated 5 years ago
- [ICLR'21] Dataset Inference for Ownership Resolution in Machine Learning☆32Updated 2 years ago
- Investigating the robustness of state-of-the-art CNN architectures to simple spatial transformations.☆49Updated 5 years ago
- Code for paper "Robustness of Bayesian Neural Networks to Gradient-Based Attacks"☆17Updated last year
- ☆25Updated 6 years ago
- ☆87Updated 10 months ago
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆27Updated 6 months ago
- Repository for Certified Defenses for Adversarial Patch ICLR-2020☆32Updated 4 years ago
- ☆53Updated 2 years ago
- [CVPR 2022] "Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free" by Tianlong Chen*, Zhenyu Zhang*, Yihua Zhang*, Shiyu C…☆26Updated 2 years ago
- Tensorflow implementation of Meta Adversarial Training for Adversarial Patch Attacks on Tiny ImageNet.☆25Updated 4 years ago
- Feature Scattering Adversarial Training (NeurIPS19)☆73Updated last year