meng-wenlong / LMSanitatorLinks

☆22

Alternatives and similar repositories for LMSanitator

Users that are interested in LMSanitator are comparing it to the libraries listed below

Sorting:

PurduePAML / DBS
☆18Updated 2 years ago
zhangrui4041 / Instruction_Backdoor_Attack
☆25Updated 11 months ago
shaoshuo-ss / EaaW
[NDSS 2025] Official code for our paper "Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Wate…
☆39Updated 9 months ago
MiracleHH / CBA
Composite Backdoor Attacks Against Large Language Models
☆16Updated last year
Gwinhen / BackdoorVault
A toolbox for backdoor attacks.
☆22Updated 2 years ago
mengtong0110 / InferDPT
☆31Updated 3 months ago
RU-System-Software-and-Security / FeatureRE
☆27Updated 2 years ago
reds-lab / ASSET
This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…
☆19Updated 2 years ago
SCLBD / DBD
☆31Updated 3 years ago
grasses / PoisonPrompt
Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107
☆17Updated 11 months ago
Gwinhen / PixelBackdoor
This is the implementation for CVPR 2022 Oral paper "Better Trigger Inversion Optimization in Backdoor Scanning."
☆24Updated 3 years ago
bangawayoo / nlp-watermarking
Robust natural language watermarking using invariant features
☆26Updated last year
lancopku / codable-watermarking-for-llm
Repository for Towards Codable Watermarking for Large Language Models
☆38Updated last year
Megum1 / ODSCAN
[IEEE S&P'24] ODSCAN: Backdoor Scanning for Object Detection Models
☆17Updated 7 months ago
Lyz1213 / BadEdit
☆32Updated 9 months ago
bboylyg / RNP
Reconstructive Neuron Pruning for Backdoor Defense (ICML 2023)
☆38Updated last year
DeepLearningSecurityGroup / Cyber_Security_Reading_Group
☆11Updated 2 weeks ago
AI-secure / Meta-Nerual-Trojan-Detection
☆66Updated 4 years ago
lvpeizhuo / MEA-Defender
This is the source code for MEA-Defender. Our paper is accepted by the IEEE Symposium on Security and Privacy (S&P) 2024.
☆25Updated last year
bboylyg / ABL
Anti-Backdoor learning (NeurIPS 2021)
☆82Updated 2 years ago
Gwinhen / DRUPE
Distribution Preserving Backdoor Attack in Self-supervised Learning
☆16Updated last year
thunlp / HiddenKiller
Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"
☆43Updated 2 years ago
JunfengGo / SCALE-UP
☆24Updated last year
RU-System-Software-and-Security / BppAttack
☆21Updated 2 years ago
csdongxian / ANP_backdoor
Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"
☆58Updated 2 years ago
T0hsakar1n / RAPID
Source code and scripts for the paper "Is Difficulty Calibration All We Need? Towards More Practical Membership Inference Attacks"
☆18Updated 7 months ago
YiZeng623 / I-BAU
Official Implementation of ICLR 2022 paper, ``Adversarial Unlearning of Backdoors via Implicit Hypergradient''
☆53Updated 2 years ago
KaiyuanZh / OrthogLinearBackdoor
[Oakland 2024] Exploring the Orthogonality and Linearity of Backdoor Attacks
☆27Updated 3 months ago
wanlunsec / Beatrix
☆24Updated 2 years ago
Huiying-Li / Latent-Backdoor
This is the documentation of the Tensorflow/Keras implementation of Latent Backdoor Attacks. Please see the paper for details Latent Back…
☆19Updated 3 years ago