thunlp/StyleAttack

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thunlp/StyleAttack)

thunlp / StyleAttack

Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"

☆46

Alternatives and similar repositories for StyleAttack

Users that are interested in StyleAttack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thunlp / ONION
View on GitHub
Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"
☆39Nov 3, 2021Updated 4 years ago
thunlp / BkdAtk-LWS
View on GitHub
Code and data of the ACL 2021 paper "Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution"
☆16Jun 29, 2021Updated 5 years ago
thunlp / OpenBackdoor
View on GitHub
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
☆209Apr 10, 2023Updated 3 years ago
martiansideofthemoon / style-transfer-paraphrase
View on GitHub
Official code and data repository for our EMNLP 2020 long paper "Reformulating Unsupervised Style Transfer as Paraphrase Generation" (htt…
☆240Jun 13, 2022Updated 4 years ago
Jinxhy / AppAIsecurity
View on GitHub
[ICSE-SEIP'21] Robustness of on-device Models: AdversarialAttack to Deep Learning Models on Android Apps
☆15Jun 2, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yujingmarkjiang / Time_Series_Backdoor_Attack
View on GitHub
SaTML'23 paper "Backdoor Attacks on Time Series: A Generative Approach" by Yujing Jiang, Xingjun Ma, Sarah Monazam Erfani, and James Bail…
☆21Feb 5, 2023Updated 3 years ago
lancopku / RAP
View on GitHub
Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)
☆25Oct 21, 2021Updated 4 years ago
grasses / PoisonPrompt
View on GitHub
Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107
☆21Aug 10, 2024Updated last year
SimengSun / ChapterBreak
View on GitHub
☆12Jun 5, 2024Updated 2 years ago
UNHSAILLab / working-memory-attack-on-llms
View on GitHub
Working Memory Attack on LLMs
☆18May 27, 2025Updated last year
abhinavkashyap / dct
View on GitHub
Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"
☆16Jan 19, 2024Updated 2 years ago
PurduePAML / PICCOLO
View on GitHub
☆26Dec 1, 2022Updated 3 years ago
hychaochao / Chat-Models-Backdoor-Attacking
View on GitHub
Code for the paper "Exploring Backdoor Vulnerabilities of Chat Models"
☆19Apr 13, 2024Updated 2 years ago
sail-sg / AnyDoor
View on GitHub
AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models
☆61Apr 8, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OSU-NLP-Group / SELM
View on GitHub
Symmetric Encryption with Language Models
☆13Jun 13, 2023Updated 3 years ago
centerforaisafety / tdc2023-starter-kit
View on GitHub
This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
☆92May 19, 2024Updated 2 years ago
zhangrui4041 / Instruction_Backdoor_Attack
View on GitHub
☆26Aug 21, 2024Updated last year
papersPapers / BadPrompt
View on GitHub
Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"
☆41Jul 8, 2024Updated 2 years ago
PlusLabNLP / AESOP
View on GitHub
Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)
☆26Jan 17, 2022Updated 4 years ago
lifan-yuan / OOD_NLP
View on GitHub
[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…
☆37Jun 8, 2023Updated 3 years ago
thunlp / Sememe-SC
View on GitHub
Source code and data for ACL 2019 paper "Modeling Semantic Compositionality with Sememe Knowledge"
☆34Jul 30, 2020Updated 5 years ago
JHL-HUST / FGPM
View on GitHub
Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks
☆24Dec 11, 2020Updated 5 years ago
ain-soph / trojanzoo
View on GitHub
TrojanZoo provides a universal pytorch platform to conduct security researches (especially backdoor attacks/defenses) of image classifica…
☆303Aug 25, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
mmalekzadeh / honest-but-curious-nets
View on GitHub
Honest-but-Curious Nets: Sensitive Attributes of Private Inputs Can Be Secretly Coded into the Classifiers' Outputs (ACM CCS'21)
☆17Jan 11, 2023Updated 3 years ago
shtoshni / g2p
View on GitHub
Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models
☆15Feb 20, 2019Updated 7 years ago
SewoongLab / spectre-defense
View on GitHub
Defending Against Backdoor Attacks Using Robust Covariance Estimation
☆22Jul 12, 2021Updated 5 years ago
claws-lab / casper
View on GitHub
Code and data for the ACM CIKM 2022 paper "Rank List Sensitivity of Recommender Systems to Interaction Perturbations"
☆10Aug 16, 2022Updated 3 years ago
CarlosGomes98 / ECG-Classification
View on GitHub
Machine Learning for Healthcare
☆10Mar 28, 2020Updated 6 years ago
tmlr-group / DeepInception
View on GitHub
[arXiv:2311.03191] "DeepInception: Hypnotize Large Language Model to Be Jailbreaker"
☆176Feb 20, 2024Updated 2 years ago
andyzoujm / breaking-llama-guard
View on GitHub
Code to break Llama Guard
☆32Dec 7, 2023Updated 2 years ago
yuezunli / ISSBA
View on GitHub
Invisible Backdoor Attack with Sample-Specific Triggers
☆106Aug 2, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bboylyg / RNP
View on GitHub
Reconstructive Neuron Pruning for Backdoor Defense (ICML 2023)
☆40Dec 24, 2023Updated 2 years ago
reza321 / T-Miner
View on GitHub
☆19Mar 9, 2024Updated 2 years ago
thu-coai / JailbreakDefense_GoalPriority
View on GitHub
[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
☆29Jul 9, 2024Updated 2 years ago
lancopku / Embedding-Poisoning
View on GitHub
Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…
☆45Jul 26, 2021Updated 4 years ago
BiDAlab / ECGXtractor
View on GitHub
☆14Oct 11, 2024Updated last year
wxhdf / MRM
View on GitHub
A Multi-Resolution Mutual Learning Network for Multi-Label ECG Classification
☆13Mar 14, 2025Updated last year
ZiangYan / pda.pytorch
View on GitHub
Implementation of our ICLR 2021 paper: Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples.
☆11Mar 9, 2021Updated 5 years ago