shenyizg / NewAdversarialAttackPaper
A list of recent adversarial attack and defense papers (including those on large language models)
☆31Updated this week
Alternatives and similar repositories for NewAdversarialAttackPaper:
Users that are interested in NewAdversarialAttackPaper are comparing it to the libraries listed below
- ☆26Updated 2 years ago
- A list of recent papers about adversarial learning☆113Updated this week
- ☆92Updated last year
- This is an official repository of ``VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models'' (NeurIPS 2…☆45Updated 3 months ago
- ☆24Updated 2 years ago
- A curated list of papers for the transferability of adversarial examples☆60Updated 7 months ago
- ☆31Updated 2 years ago
- ☆23Updated last year
- [ICLR24] Official Repo of BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models☆26Updated 6 months ago
- ☆38Updated last month
- ☆30Updated 7 months ago
- A list of papers in NeurIPS 2022 related to adversarial attack and defense / AI security.☆71Updated 2 years ago
- Code for paper: "PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification", IEEE S&P 2024.☆30Updated 6 months ago
- [S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models☆18Updated this week
- Code Repository for the Paper ---Revisiting the Assumption of Latent Separability for Backdoor Defenses (ICLR 2023)☆39Updated last year
- An Embarrassingly Simple Backdoor Attack on Self-supervised Learning☆17Updated last year
- This is the source code for Data-free Backdoor. Our paper is accepted by the 32nd USENIX Security Symposium (USENIX Security 2023).☆31Updated last year
- A curated list of trustworthy Generative AI papers. Daily updating...☆69Updated 5 months ago
- ☆65Updated 6 months ago
- Implementation of BadCLIP https://arxiv.org/pdf/2311.16194.pdf☆18Updated 10 months ago
- ☆79Updated 3 years ago
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆55Updated 2 months ago
- Code repository for the paper --- [USENIX Security 2023] Towards A Proactive ML Approach for Detecting Backdoor Poison Samples☆25Updated last year
- This is for releasing the source code of the ACSAC paper "STRIP: A Defence Against Trojan Attacks on Deep Neural Networks"☆54Updated 3 months ago
- Invisible Backdoor Attack with Sample-Specific Triggers☆94Updated 2 years ago
- ☆20Updated last year
- ☆18Updated 2 years ago
- Composite Backdoor Attacks Against Large Language Models☆11Updated 10 months ago
- Official Implementation of ICLR 2022 paper, ``Adversarial Unlearning of Backdoors via Implicit Hypergradient''☆54Updated 2 years ago