machinelearning4health / TextHoaxer
Implementation Code of TextHoaxer
☆14Updated 2 years ago
Alternatives and similar repositories for TextHoaxer:
Users that are interested in TextHoaxer are comparing it to the libraries listed below
- Natural Language Attacks in a Hard Label Black Box Setting.☆47Updated 3 years ago
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆33Updated 3 years ago
- [Findings of ACL 2023] Bridge the Gap Between CV and NLP! A Optimization-based Textual Adversarial Attack Framework.☆13Updated last year
- An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)☆175Updated 2 years ago
- Hidden backdoor attack on NLP systems☆47Updated 3 years ago
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆42Updated 2 years ago
- Paper list of Adversarial Examples☆46Updated last year
- Anti-Backdoor learning (NeurIPS 2021)☆81Updated last year
- Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"☆42Updated 2 years ago
- MASTERKEY is a framework designed to explore and exploit vulnerabilities in large language model chatbots by automating jailbreak attacks…☆20Updated 7 months ago
- ☆13Updated last year
- An unofficial implementation of AutoDAN attack on LLMs (arXiv:2310.15140)☆37Updated last year
- ☆26Updated 2 years ago
- A curated list of trustworthy Generative AI papers. Daily updating...☆71Updated 7 months ago
- ☆19Updated 10 months ago
- ☆14Updated 11 months ago
- ☆144Updated 6 months ago
- ☆80Updated last year
- This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Lang…☆16Updated 9 months ago
- Code repository for the paper --- [USENIX Security 2023] Towards A Proactive ML Approach for Detecting Backdoor Poison Samples☆25Updated last year
- A list of recent adversarial attack and defense papers (including those on large language models)☆37Updated this week
- ☆25Updated 6 months ago
- ☆11Updated 3 years ago
- Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)☆24Updated 3 years ago
- codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"☆31Updated last year
- The most comprehensive and accurate LLM jailbreak attack benchmark by far☆19Updated 3 weeks ago
- SAFER: A Structure-free Approach For cErtified Robustness to Adversarial Word Substitutions (ACL 2020)☆29Updated 4 years ago
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆24Updated last year
- ☆18Updated 2 years ago
- [NDSS 2025] "CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models"☆12Updated 4 months ago