inspire-group / tta_riskLinks

☆14

Alternatives and similar repositories for tta_risk

Users that are interested in tta_risk are comparing it to the libraries listed below

Sorting:

zhuohuangai / SharpDRO
Code for CVPR 2023 Robust Generalization against Photon-Limited Corruptions via Worst-Case Sharpness Minimization
☆13Updated 2 years ago
OPTML-Group / Unlearn-Sparse
[NeurIPS23 (Spotlight)] "Model Sparsity Can Simplify Machine Unlearning" by Jinghan Jia*, Jiancheng Liu*, Parikshit Ram, Yuguang Yao, Gao…
☆81Updated last year
AISafety-HKUST / Backdoor_Safety_Tuning
Backdoor Safety Tuning (NeurIPS 2023 & 2024 Spotlight)
☆26Updated 11 months ago
eth-sri / privacy-inference-multimodal
☆16Updated 8 months ago
conditionWang / Data_Centric_AI_IP_Protection
This is the repository that introduces research topics related to protecting intellectual property (IP) of AI from a data-centric perspec…
☆23Updated 2 years ago
kaiwenzha / contrastive-poisoning
[ICLR 2023, Spotlight] Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning
☆31Updated last year
damon-demon / Black-Box-Defense
Robustify Black-Box Models (ICLR'22 - Spotlight)
☆24Updated 2 years ago
cvlab-columbia / ZSRobust4FoundationModel
☆43Updated 2 years ago
LijieFan / AdvCL
[NeurIPS 2021] “When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?”
☆48Updated 3 years ago
PKU-ML / CFA
☆20Updated 7 months ago
jinghuichen / AWM
Github repo for One-shot Neural Backdoor Erasing via Adversarial Weight Masking (NeurIPS 2022)
☆15Updated 2 years ago
TomSheng21 / AdaptGuard
ICCV 2023 - AdaptGuard: Defending Against Universal Attacks for Model Adaptation
☆11Updated last year
reds-lab / Meta-Sift
The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on …
☆19Updated 2 years ago
conditionWang / NTL
This is the code of ICLR 2022 Oral paper 'Non-Transferable Learning: A New Approach for Model Ownership Verification and Applicability Au…
☆30Updated 2 years ago
ChaojianYu / Robust-Weight-Perturbation
Implementation for <Robust Weight Perturbation for Adversarial Training> in IJCAI'22.
☆16Updated 3 years ago
boyellow / AdaAD
Code for the paper Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation (CVPR 2023).
☆33Updated 2 years ago
OPTML-Group / Unlearn-WorstCase
[ECCV24] "Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning" by Chongyu Fan*, Jiancheng Liu*, Alfred Hero, …
☆23Updated 5 months ago
justincui03 / dc_benchmark
☆87Updated 2 years ago
lafeat / apbench
APBench: A Unified Availability Poisoning Attack and Defenses Benchmark (TMLR 08/2024)
☆36Updated 6 months ago
mo666666 / When-Adversarial-Training-Meets-Vision-Transformers
Official implementation of "When Adversarial Training Meets Vision Transformers: Recipes from Training to Architecture" published at Neur…
☆34Updated last year
meet-cjli / CTRL
An Embarrassingly Simple Backdoor Attack on Self-supervised Learning
☆18Updated last year
Sadcardation / MLLM-Refusal
Repository for the Paper: Refusing Safe Prompts for Multi-modal Large Language Models
☆18Updated last year
inspire-group / DP-RandP
[NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes
☆12Updated 2 years ago
serendipity1122 / Pre-trained-Model-Guided-Fine-Tuning-for-Zero-Shot-Adversarial-Robustness
Code repository for CVPR2024 paper 《Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness》
☆22Updated last year
ybwang119 / label_recovery
[ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks
☆13Updated last year
itsvaibhav01 / Immune
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
☆24Updated 4 months ago
fangjf1 / OpenSafeMLRM
The first toolkit for MLRM safety evaluation, providing unified interface for mainstream models, datasets, and jailbreaking methods!
☆13Updated 6 months ago
OODRobustBench / OODRobustBench
OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift. ICML 2024 and ICLRW-DMLR 2024
☆23Updated last year
BrachioLab / adversarial_prompting
☆53Updated 2 years ago
bboylyg / RNP
Reconstructive Neuron Pruning for Backdoor Defense (ICML 2023)
☆39Updated last year