kangjie-chen / BadPre
☆11Updated 3 years ago
Alternatives and similar repositories for BadPre:
Users that are interested in BadPre are comparing it to the libraries listed below
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆41Updated 2 years ago
- Distribution Preserving Backdoor Attack in Self-supervised Learning☆14Updated last year
- A toolbox for backdoor attacks.☆21Updated 2 years ago
- ☆19Updated 2 years ago
- This is the implementation for CVPR 2022 Oral paper "Better Trigger Inversion Optimization in Backdoor Scanning."☆24Updated 2 years ago
- Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"☆57Updated last year
- ☆26Updated 2 years ago
- ☆24Updated 4 months ago
- Anti-Backdoor learning (NeurIPS 2021)☆82Updated last year
- A minimal PyTorch implementation of Label-Consistent Backdoor Attacks☆30Updated 4 years ago
- [IEEE S&P'24] ODSCAN: Backdoor Scanning for Object Detection Models☆13Updated 2 months ago
- Official Implementation of ICLR 2022 paper, ``Adversarial Unlearning of Backdoors via Implicit Hypergradient''☆54Updated 2 years ago
- [AAAI'21] Deep Feature Space Trojan Attack of Neural Networks by Controlled Detoxification☆28Updated 2 months ago
- ☆14Updated last year
- [IEEE S&P 2024] Exploring the Orthogonality and Linearity of Backdoor Attacks☆21Updated 2 months ago
- Official implementation of (CVPR 2022 Oral) Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks.☆26Updated 2 years ago
- ☆14Updated 2 years ago
- This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…☆17Updated last year
- Repository for Towards Codable Watermarking for Large Language Models☆35Updated last year
- ☆20Updated last year
- TrojanLM: Trojaning Language Models for Fun and Profit☆16Updated 3 years ago
- ☆79Updated 3 years ago
- ☆42Updated 2 months ago
- Code for "Label-Consistent Backdoor Attacks"☆53Updated 4 years ago
- This is the documentation of the Tensorflow/Keras implementation of Latent Backdoor Attacks. Please see the paper for details Latent Back…☆19Updated 3 years ago
- 复现了下Neural Cleanse这篇论文,真的是简单而有效,发在了okaland☆30Updated 3 years ago
- [NDSS 2025] Official code for our paper "Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Wate…☆30Updated 4 months ago
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆55Updated 2 months ago
- This is for releasing the source code of the ACSAC paper "STRIP: A Defence Against Trojan Attacks on Deep Neural Networks"☆54Updated 4 months ago
- Code and data of the ACL 2021 paper "Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution"☆16Updated 3 years ago