haowang02 / TransTrojLinks
[WWW '25] Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability
☆17Updated 7 months ago
Alternatives and similar repositories for TransTroj
Users that are interested in TransTroj are comparing it to the libraries listed below
Sorting:
- WaNet - Imperceptible Warping-based Backdoor Attack (ICLR 2021)☆133Updated last year
- A curated list of papers & resources on backdoor attacks and defenses in deep learning.☆231Updated last year
- 复现了下Neural Cleanse这篇论文,真的是简单而有效,发在了okaland☆33Updated 4 years ago
- A list of recent papers about adversarial learning☆277Updated this week
- [NDSS 2025] Official code for our paper "Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Wate…☆45Updated last year
- ☆16Updated last year
- A curated list of papers & resources linked to data poisoning, backdoor attacks and defenses against them (no longer maintained)☆282Updated last year
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆63Updated last year
- Official Code for ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users (NeurIPS 2024)☆23Updated last year
- ☆26Updated last year
- ☆37Updated last year
- ☆17Updated last year
- Composite Backdoor Attacks Against Large Language Models☆21Updated last year
- ☆27Updated 2 years ago
- ☆21Updated 3 years ago
- ☆55Updated last year
- A toolbox for backdoor attacks.☆23Updated 2 years ago
- Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks (IEEE S&P 2024)☆34Updated 6 months ago
- APBench: A Unified Availability Poisoning Attack and Defenses Benchmark (TMLR 08/2024)☆46Updated 8 months ago
- Code repository for the paper --- [USENIX Security 2023] Towards A Proactive ML Approach for Detecting Backdoor Poison Samples☆30Updated 2 years ago
- Revisiting Transferable Adversarial Images (TPAMI 2025)☆139Updated 4 months ago
- Source code and scripts for the paper "Is Difficulty Calibration All We Need? Towards More Practical Membership Inference Attacks"☆20Updated last year
- [NDSS'25] The official implementation of safety misalignment.☆17Updated last year
- [ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images☆42Updated last year
- [NeurIPS 2025] BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models☆262Updated 2 months ago
- ☆32Updated last year
- Anti-Backdoor learning (NeurIPS 2021)☆83Updated 2 years ago
- ☆75Updated last year
- Invisible Backdoor Attack with Sample-Specific Triggers☆103Updated 3 years ago
- ☆71Updated 7 months ago