machinelearning4health / TextHoaxerLinks
Implementation Code of TextHoaxer
☆14Updated 2 years ago
Alternatives and similar repositories for TextHoaxer
Users that are interested in TextHoaxer are comparing it to the libraries listed below
Sorting:
- Natural Language Attacks in a Hard Label Black Box Setting.☆47Updated 4 years ago
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆43Updated 2 years ago
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆33Updated 3 years ago
- Hidden backdoor attack on NLP systems☆47Updated 3 years ago
- An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)☆183Updated 2 years ago
- ☆11Updated 5 years ago
- Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"☆42Updated 2 years ago
- Paper list of Adversarial Examples☆48Updated last year
- A list of recent adversarial attack and defense papers (including those on large language models)☆40Updated last week
- ☆19Updated last year
- ☆14Updated last year
- ☆28Updated 7 months ago
- ☆23Updated 9 months ago
- [Findings of ACL 2023] Bridge the Gap Between CV and NLP! A Optimization-based Textual Adversarial Attack Framework.☆13Updated last year
- The most comprehensive and accurate LLM jailbreak attack benchmark by far☆19Updated 2 months ago
- ☆18Updated 3 years ago
- Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT☆196Updated 4 years ago
- This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Lang…☆17Updated 11 months ago
- SAFER: A Structure-free Approach For cErtified Robustness to Adversarial Word Substitutions (ACL 2020)☆31Updated 4 years ago
- Anti-Backdoor learning (NeurIPS 2021)☆81Updated last year
- ☆15Updated 8 months ago
- ☆11Updated 3 years ago
- ☆27Updated 2 years ago
- A toolbox for backdoor attacks.☆22Updated 2 years ago
- TrojanLM: Trojaning Language Models for Fun and Profit☆16Updated 3 years ago
- 🔥🔥🔥 Detecting hidden backdoors in Large Language Models with only black-box access☆27Updated 6 months ago
- An Open-Source Package for Textual Adversarial Attack.☆731Updated last year
- Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"☆88Updated 4 years ago
- Machine Learning & Security Seminar @Purdue University☆25Updated 2 years ago
- Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…☆40Updated 3 years ago