machinelearning4health / TextHoaxerLinks
Implementation Code of TextHoaxer
☆15Updated 3 years ago
Alternatives and similar repositories for TextHoaxer
Users that are interested in TextHoaxer are comparing it to the libraries listed below
Sorting:
- Natural Language Attacks in a Hard Label Black Box Setting.☆50Updated 4 years ago
- An Open-Source Package for Textual Adversarial Attack.☆768Updated 2 years ago
- An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)☆200Updated 2 years ago
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆43Updated 3 years ago
- Must-read Papers on Textual Adversarial Attack and Defense☆1,576Updated 8 months ago
- TrojanZoo provides a universal pytorch platform to conduct security researches (especially backdoor attacks/defenses) of image classifica…☆302Updated 5 months ago
- [Findings of ACL 2023] Bridge the Gap Between CV and NLP! A Optimization-based Textual Adversarial Attack Framework.☆14Updated 2 years ago
- ☆19Updated last year
- Hidden backdoor attack on NLP systems☆47Updated 4 years ago
- A list of recent adversarial attack and defense papers (including those on large language models)☆46Updated 2 weeks ago
- ☆151Updated last year
- A curated list of papers & resources on backdoor attacks and defenses in deep learning.☆235Updated last year
- ☆37Updated last year
- ☆26Updated last year
- ☆11Updated 3 years ago
- ☆11Updated 5 years ago
- SAFER: A Structure-free Approach For cErtified Robustness to Adversarial Word Substitutions (ACL 2020)☆31Updated 5 years ago
- Machine Learning & Security Seminar @Purdue University☆25Updated 2 years ago
- Bad Characters: Imperceptible NLP Attacks☆35Updated last year
- [USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models☆230Updated 2 weeks ago
- Paper list of Adversarial Examples☆52Updated 2 years ago
- ☆15Updated 2 years ago
- A Model for Natural Language Attack on Text Classification and Inference☆527Updated 3 years ago
- The open-sourced Python toolbox for backdoor attacks and defenses.☆641Updated 4 months ago
- TrojanLM: Trojaning Language Models for Fun and Profit☆16Updated 4 years ago
- [CIKM 2024] Trojan Activation Attack: Attack Large Language Models using Activation Steering for Safety-Alignment.☆29Updated last year
- White-box Fairness Testing through Adversarial Sampling☆13Updated 4 years ago
- Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"☆27Updated last year
- [ICLR24] Official Repo of BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models☆48Updated last year
- Composite Backdoor Attacks Against Large Language Models☆22Updated last year