wagner-group / MarkMyWords
☆25Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for MarkMyWords
- Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"☆55Updated last year
- Stable Backdoor Purification (NeurIPS 2023 & 2024)☆23Updated this week
- ☆53Updated last year
- Repository for Towards Codable Watermarking for Large Language Models☆29Updated last year
- [ICLR'21] Dataset Inference for Ownership Resolution in Machine Learning☆31Updated 2 years ago
- ☆21Updated 5 months ago
- code release for "Unrolling SGD: Understanding Factors Influencing Machine Unlearning" published at EuroS&P'22☆22Updated 2 years ago
- [ICML 2023] Are Diffusion Models Vulnerable to Membership Inference Attacks?☆30Updated 2 months ago
- Reconstructive Neuron Pruning for Backdoor Defense (ICML 2023)☆28Updated 10 months ago
- ☆23Updated last year
- ☆19Updated last month
- ☆20Updated last year
- Certified Removal from Machine Learning Models☆63Updated 3 years ago
- Camouflage poisoning via machine unlearning☆15Updated last year
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆11Updated last year
- The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on …☆18Updated last year
- ☆38Updated 2 months ago
- Official Implementation of ICLR 2022 paper, ``Adversarial Unlearning of Backdoors via Implicit Hypergradient''☆50Updated last year
- Code for "Label-Consistent Backdoor Attacks"☆49Updated 3 years ago
- ICCV 2021, We find most existing triggers of backdoor attacks in deep learning contain severe artifacts in the frequency domain. This Rep…☆40Updated 2 years ago
- Code for paper: "PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification", IEEE S&P 2024.☆28Updated 3 months ago
- Official implementation of "RelaxLoss: Defending Membership Inference Attacks without Losing Utility" (ICLR 2022)☆45Updated 2 years ago
- Code for Arxiv When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?☆15Updated last month
- ☆16Updated 6 months ago
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆41Updated 6 months ago
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆25Updated 5 months ago
- Code release for DeepJudge (S&P'22)☆51Updated last year
- RAB: Provable Robustness Against Backdoor Attacks☆39Updated last year
- [ICLR2023] Distilling Cognitive Backdoor Patterns within an Image☆31Updated 3 weeks ago
- ☆10Updated 2 years ago