dangne / tmd
[EMNLP'22] Textual Manifold-based Defense Against Natural Language Adversarial Examples
☆11Updated last year
Alternatives and similar repositories for tmd:
Users that are interested in tmd are comparing it to the libraries listed below
- codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"☆31Updated last year
- ☆18Updated last year
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆17Updated last year
- ☆53Updated last year
- Identification of the Adversary from a Single Adversarial Example (ICML 2023)☆9Updated 7 months ago
- ☆12Updated 3 months ago
- ☆40Updated last year
- ☆17Updated last year
- [ICLR 2022 official code] Robust Learning Meets Generative Models: Can Proxy Distributions Improve Adversarial Robustness?☆29Updated 2 years ago
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆18Updated 10 months ago
- AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models☆49Updated 10 months ago
- Code for the paper "Autoregressive Perturbations for Data Poisoning" (NeurIPS 2022)☆19Updated 5 months ago
- This is the code of ICLR 2022 Oral paper 'Non-Transferable Learning: A New Approach for Model Ownership Verification and Applicability Au…☆30Updated last year
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆46Updated 9 months ago
- [ECCV'24] T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models☆12Updated last month
- ☆30Updated 7 months ago
- ☆53Updated last year
- ☆34Updated 2 months ago
- Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 2A).☆11Updated last month
- [CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu☆26Updated 5 months ago
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆75Updated last year
- ☆31Updated 8 months ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆11Updated last year
- ☆14Updated 3 years ago
- Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021☆43Updated 3 years ago
- ☆22Updated 3 years ago
- ☆13Updated 9 months ago
- ☆13Updated 8 months ago
- CVPR2023: Unlearnable Clusters: Towards Label-agnostic Unlearnable Examples☆21Updated last year
- [ECCV-2024] Transferable Targeted Adversarial Attack, CLIP models, Generative adversarial network, Multi-target attacks☆28Updated 6 months ago