Adversarially Robust Transfer Learning with LWF loss applied to the deep feature representation (penultimate) layer
☆19Feb 9, 2020Updated 6 years ago
Alternatives and similar repositories for RobustTransferLWF
Users that are interested in RobustTransferLWF are comparing it to the libraries listed below
Sorting:
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)☆11Dec 30, 2022Updated 3 years ago
- Code implementing the experiments described in the NeurIPS 2018 paper "With Friends Like These, Who Needs Adversaries?".☆13Sep 11, 2020Updated 5 years ago
- Implementation for What it Thinks is Important is Important: Robustness Transfers through Input Gradients (CVPR 2020 Oral)☆16Mar 24, 2023Updated 2 years ago
- Official Implementation for PlugIn Inversion☆16Oct 23, 2021Updated 4 years ago
- ☆19Jun 26, 2021Updated 4 years ago
- This is the code for semi-supervised robust training (SRT).☆18Mar 24, 2023Updated 2 years ago
- This is the official implementation of ClusTR: Clustering Training for Robustness paper.☆20Oct 20, 2021Updated 4 years ago
- [CVPR'19] Trust Region Based Adversarial Attack☆20Dec 11, 2020Updated 5 years ago
- Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"☆21Dec 10, 2021Updated 4 years ago
- Adversarial Robustness, White-box, Adversarial Attack☆52Jul 6, 2022Updated 3 years ago
- Fluent student-teacher redteaming☆23Jul 25, 2024Updated last year
- Semisupervised learning for adversarial robustness https://arxiv.org/pdf/1905.13736.pdf☆142Mar 30, 2020Updated 5 years ago
- Revisiting Residual Networks for Adversarial Robustness: An Architectural Perspective☆19Jun 7, 2024Updated last year
- ☆26Jul 18, 2019Updated 6 years ago
- Datasets for the paper "Adversarial Examples are not Bugs, They Are Features"☆187Sep 17, 2020Updated 5 years ago
- Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks☆38May 25, 2021Updated 4 years ago
- [NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Z…☆125Dec 29, 2021Updated 4 years ago
- TRADES (TRadeoff-inspired Adversarial DEfense via Surrogate-loss minimization)☆553Mar 30, 2023Updated 2 years ago
- A curated list of 150+ papers and resources on Agentic Security. Based on the survey covering the transition from passive LLMs to autonom…☆28Dec 6, 2025Updated 2 months ago
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago
- Robust Contrastive Learning Using Negative Samples with Diminished Semantics (NeurIPS 2021)☆39Dec 6, 2021Updated 4 years ago
- ☆13Jul 20, 2023Updated 2 years ago
- This project is the official implementation of "Local and Global Logit Adjustments for Long-Tailed Learning", ICCV 2023☆12Feb 19, 2024Updated 2 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- The first high school physics Olympiad benchmark for evaluating (M)LLMs with step-level grading and human-level comparison.☆26Dec 19, 2025Updated 2 months ago
- Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.☆11Oct 20, 2020Updated 5 years ago
- Simple implementation of text-based Gridworld game. Intended for use with reinforcement learning algorithms.☆15Apr 29, 2018Updated 7 years ago
- code for AAAI accepted paper Similarity Distribution based Membership Inference Attack on Person Re-Identification.☆11Sep 29, 2024Updated last year
- LSA : Layer Sustainability Analysis framework for the analysis of layer vulnerability in a given neural network. LSA can be a helpful too…☆18Mar 22, 2022Updated 3 years ago
- ☆10Mar 10, 2021Updated 4 years ago
- This repository contains the artifacts accompanied by the paper "Fair Preprocessing"☆13Jul 20, 2021Updated 4 years ago
- ☆19May 14, 2025Updated 9 months ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆13Dec 16, 2024Updated last year
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- Utility functions for weights and biases (wandb).☆11Sep 17, 2024Updated last year
- content.rdf.u8.gz☆10Dec 15, 2020Updated 5 years ago
- Official Implementation of implicit reference attack☆11Oct 16, 2024Updated last year
- Code for the paper "Overconfidence is a Dangerous Thing: Mitigating Membership Inference Attacks by Enforcing Less Confident Prediction" …☆12Sep 6, 2023Updated 2 years ago
- CAAD 2018 winning submissions☆35Feb 10, 2019Updated 7 years ago