yxoh / prompt_leak_usenix2024
☆12Updated 11 months ago
Alternatives and similar repositories for prompt_leak_usenix2024:
Users that are interested in prompt_leak_usenix2024 are comparing it to the libraries listed below
- ☆19Updated 6 months ago
- This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…☆17Updated last year
- The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on …☆18Updated last year
- Reconstructive Neuron Pruning for Backdoor Defense (ICML 2023)☆35Updated last year
- [CCS-LAMPS'24] LLM IP Protection Against Model Merging☆14Updated 6 months ago
- Backdoor Safety Tuning (NeurIPS 2023 & 2024 Spotlight)☆25Updated 4 months ago
- ☆25Updated 2 years ago
- ☆12Updated 3 years ago
- ☆24Updated 2 years ago
- Official Implementation of NIPS 2022 paper Pre-activation Distributions Expose Backdoor Neurons☆14Updated 2 years ago
- Public implementation of the paper "On the Importance of Difficulty Calibration in Membership Inference Attacks".☆16Updated 3 years ago
- This is the repository that introduces research topics related to protecting intellectual property (IP) of AI from a data-centric perspec…☆22Updated last year
- verifying machine unlearning by backdooring☆20Updated 2 years ago
- Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"☆57Updated last year
- ☆19Updated 11 months ago
- ☆11Updated 2 years ago
- ☆17Updated 3 years ago
- Camouflage poisoning via machine unlearning☆17Updated 2 years ago
- Code for Backdoor Attacks Against Dataset Distillation☆34Updated last year
- This is the implementation for CVPR 2022 Oral paper "Better Trigger Inversion Optimization in Backdoor Scanning."☆24Updated 3 years ago
- ☆31Updated 3 years ago
- Github repo for One-shot Neural Backdoor Erasing via Adversarial Weight Masking (NeurIPS 2022)☆15Updated 2 years ago
- Code for paper: "PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification", IEEE S&P 2024.☆30Updated 8 months ago
- Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107☆17Updated 8 months ago
- Official Implementation of ICLR 2022 paper, ``Adversarial Unlearning of Backdoors via Implicit Hypergradient''☆53Updated 2 years ago
- [ICLR2023] Distilling Cognitive Backdoor Patterns within an Image☆34Updated 5 months ago
- ☆19Updated 2 years ago
- [IEEE S&P 2024] Exploring the Orthogonality and Linearity of Backdoor Attacks☆21Updated 3 months ago
- [ICML 2023] Are Diffusion Models Vulnerable to Membership Inference Attacks?☆34Updated 7 months ago
- Pytorch implementation of backdoor unlearning.☆17Updated 2 years ago