jianshuod / TBA
Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TBA
- The implementatin of our ICLR 2021 work: Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits☆18Updated 3 years ago
- official implementation of Towards Robust Model Watermark via Reducing Parametric Vulnerability☆12Updated 5 months ago
- ☆10Updated 8 months ago
- Data-Efficient Backdoor Attacks☆18Updated 2 years ago
- PDM-based Purifier☆13Updated this week
- ☆29Updated 2 years ago
- Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.☆43Updated 11 months ago
- [ICML 2023] "NeRFool: Uncovering the Vulnerability of Generalizable Neural Radiance Fields against Adversarial Perturbations" by Yonggan …☆14Updated 8 months ago
- ☆40Updated last year
- [NeurIPS'22] Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork. Haotao Wang, Junyuan Hong,…☆13Updated 11 months ago
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆31Updated 6 months ago
- AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models☆43Updated 7 months ago
- ☆20Updated last year
- Official Implementation of NIPS 2022 paper Pre-activation Distributions Expose Backdoor Neurons☆14Updated last year
- Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"☆35Updated 4 months ago
- ☆27Updated 9 months ago
- This is the repository that introduces research topics related to protecting intellectual property (IP) of AI from a data-centric perspec…☆22Updated last year
- Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"☆27Updated last month
- [CVPR 2022] "Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free" by Tianlong Chen*, Zhenyu Zhang*, Yihua Zhang*, Shiyu C…☆25Updated 2 years ago
- Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"☆38Updated 2 weeks ago
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆12Updated last year
- [Arxiv 2024] Adversarial attacks on multimodal agents☆37Updated 4 months ago
- [NeurIPS 2022] "Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets" by Ruisi Cai*, Zhenyu Zh…☆18Updated 2 years ago
- Code for Voice Jailbreak Attacks Against GPT-4o.☆25Updated 5 months ago
- ☆25Updated last year
- ☆17Updated last week
- [ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun…☆37Updated 2 years ago
- ☆20Updated 5 months ago
- Code for "Adversarial Illusions in Multi-Modal Embeddings"☆16Updated 3 months ago
- SEAT☆19Updated last year