machinelearning4health / TextHoaxer
Implementation Code of TextHoaxer
☆14Updated 2 years ago
Alternatives and similar repositories for TextHoaxer:
Users that are interested in TextHoaxer are comparing it to the libraries listed below
- Natural Language Attacks in a Hard Label Black Box Setting.☆47Updated 3 years ago
- An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)☆181Updated 2 years ago
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆43Updated 2 years ago
- [Findings of ACL 2023] Bridge the Gap Between CV and NLP! A Optimization-based Textual Adversarial Attack Framework.☆13Updated last year
- Hidden backdoor attack on NLP systems☆47Updated 3 years ago
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆33Updated 3 years ago
- Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT☆196Updated 4 years ago
- The most comprehensive and accurate LLM jailbreak attack benchmark by far☆19Updated last month
- ☆10Updated 2 years ago
- A list of recent adversarial attack and defense papers (including those on large language models)☆37Updated last week
- Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"☆108Updated 2 years ago
- ☆11Updated 3 years ago
- Paper list of Adversarial Examples☆46Updated last year
- 复现了下Neural Cleanse这篇论文,真的是简单而有效,发在了okaland☆30Updated 3 years ago
- ☆27Updated 2 years ago
- ☆31Updated 7 months ago
- ☆21Updated 8 months ago
- codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"☆31Updated last year
- Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"☆88Updated 4 years ago
- ☆26Updated 6 months ago
- [NDSS 2025] "CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models"☆12Updated 4 months ago
- Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)☆24Updated 3 years ago
- Anti-Backdoor learning (NeurIPS 2021)☆81Updated last year
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆18Updated last year
- ☆79Updated last year
- ☆144Updated 7 months ago
- Bad Characters: Imperceptible NLP Attacks☆34Updated last year
- ☆81Updated 3 years ago
- ☆14Updated last year
- A Query Efficient Natural Language Attack in a Black Box Setting☆16Updated 3 years ago