zhangrui4041 / Instruction_Backdoor_Attack
☆17Updated 5 months ago
Alternatives and similar repositories for Instruction_Backdoor_Attack:
Users that are interested in Instruction_Backdoor_Attack are comparing it to the libraries listed below
- Composite Backdoor Attacks Against Large Language Models☆11Updated 10 months ago
- ☆23Updated 4 months ago
- ☆22Updated 3 months ago
- ☆20Updated last year
- [IEEE S&P'24] ODSCAN: Backdoor Scanning for Object Detection Models☆12Updated last month
- Official code for our NDSS paper "Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarkin…☆28Updated 3 months ago
- SaTML'23 paper "Backdoor Attacks on Time Series: A Generative Approach" by Yujing Jiang, Xingjun Ma, Sarah Monazam Erfani, and James Bail…☆17Updated 2 years ago
- Anti-Backdoor learning (NeurIPS 2021)☆81Updated last year
- MASTERKEY is a framework designed to explore and exploit vulnerabilities in large language model chatbots by automating jailbreak attacks…☆18Updated 5 months ago
- A toolbox for backdoor attacks.☆20Updated 2 years ago
- Siren: Byzantine-robust Federated Learning via Proactive Alarming (SoCC '21)☆11Updated 10 months ago
- ☆17Updated 2 years ago
- Official implementation of (CVPR 2022 Oral) Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks.☆26Updated 2 years ago
- Distribution Preserving Backdoor Attack in Self-supervised Learning☆14Updated last year
- [USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models☆115Updated last week
- ☆64Updated 4 years ago
- This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…☆17Updated last year
- Reconstructive Neuron Pruning for Backdoor Defense (ICML 2023)☆34Updated last year
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆41Updated 2 years ago
- ☆15Updated last month
- ☆14Updated 2 years ago
- [ICLR24] Official Repo of BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models☆24Updated 6 months ago
- ☆14Updated last year
- ☆24Updated 2 years ago
- ☆79Updated 3 years ago
- ☆26Updated 2 years ago
- Machine Learning & Security Seminar @Purdue University☆25Updated last year
- [NDSS'23] BEAGLE: Forensics of Deep Learning Backdoor Attack for Better Defense☆17Updated 9 months ago
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆55Updated last month