leix28/prompt-universal-vulnerability

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/leix28/prompt-universal-vulnerability)

leix28 / prompt-universal-vulnerability

Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022

☆32

Alternatives and similar repositories for prompt-universal-vulnerability

Users that are interested in prompt-universal-vulnerability are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thunlp / OpenBackdoor
View on GitHub
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
☆209Apr 10, 2023Updated 3 years ago
MiracleHH / CBA
View on GitHub
Composite Backdoor Attacks Against Large Language Models
☆25Apr 12, 2024Updated 2 years ago
zhangrui4041 / Instruction_Backdoor_Attack
View on GitHub
☆25Aug 21, 2024Updated last year
Hsuan-Tung / universal_attack_natural_trigger
View on GitHub
Natural Universal Trigger Search (NUTS)
☆21Apr 17, 2021Updated 5 years ago
AI-secure / TextGuard
View on GitHub
TextGuard: Provable Defense against Backdoor Attacks on Text Classification
☆15Nov 7, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JHL-HUST / PWWS
View on GitHub
Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency
☆84Mar 24, 2023Updated 3 years ago
PurduePAML / PICCOLO
View on GitHub
☆26Dec 1, 2022Updated 3 years ago
zhaohan-xi / GraphBackdoor
View on GitHub
☆57Oct 5, 2022Updated 3 years ago
thunlp / HiddenKiller
View on GitHub
Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"
☆45Sep 11, 2022Updated 3 years ago
illidanlab / FADE
View on GitHub
[KDD2021] Federated Adversarial Debiasing for Fair and Transferable Representations: Optimize an adversarial domain-adaptation objective …
☆26Feb 23, 2023Updated 3 years ago
minicheshire / Robust-Prefix-Tuning
View on GitHub
code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification
☆27Mar 21, 2022Updated 4 years ago
tianshuocong / SSLGuard
View on GitHub
[CCS'22] SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders
☆18Jul 12, 2022Updated 4 years ago
mlapistudy / ICSE2021_421
View on GitHub
This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021
☆16Feb 27, 2021Updated 5 years ago
microsoft / ReACC
View on GitHub
Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“
☆67Apr 18, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lancopku / agent-backdoor-attacks
View on GitHub
Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]
☆116Sep 27, 2024Updated last year
joey1993 / bert-defender
View on GitHub
codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19
☆15Feb 25, 2020Updated 6 years ago
allenai / natural-perturbations
View on GitHub
Natural Perturbation for Robust Question Answering
☆12Apr 7, 2020Updated 6 years ago
LinyangLee / BERT-Attack
View on GitHub
Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT
☆207Sep 22, 2020Updated 5 years ago
BeyonderXX / ShadowAlignment
View on GitHub
Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
☆35Oct 19, 2023Updated 2 years ago
robinjia / certified-word-sub
View on GitHub
Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)
☆38Dec 30, 2019Updated 6 years ago
hsajjad / Interpretability-Tutorial-NAACL2021
View on GitHub
☆24Jun 7, 2021Updated 5 years ago
tapilab / aaai-2021-counterfactuals
View on GitHub
☆13Jul 6, 2021Updated 5 years ago
miyyer / scpn
View on GitHub
syntactically controlled paraphrase networks
☆168Dec 30, 2018Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
crazyofapple / AT-BMC
View on GitHub
AAAI 2022 paper - Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extraction
☆17Dec 23, 2021Updated 4 years ago
eth-sri / bayes-framework-leakage
View on GitHub
☆10Apr 21, 2022Updated 4 years ago
grasses / PoisonPrompt
View on GitHub
Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107
☆21Aug 10, 2024Updated last year
tapilab / emnlp-2020-spurious
View on GitHub
☆17Mar 22, 2021Updated 5 years ago
JHL-HUST / FGPM
View on GitHub
Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks
☆24Dec 11, 2020Updated 5 years ago
AI-secure / SemAttack
View on GitHub
[NAACL 2022] "SemAttack: Natural Textual Attacks via Different Semantic Spaces" by Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li
☆21Jun 11, 2022Updated 4 years ago
fartashf / under_convnet
View on GitHub
Caffe code for the paper "Adversarial Manipulation of Deep Representations"
☆17Nov 6, 2017Updated 8 years ago
UCDvision / PatchSearch
View on GitHub
Code for the CVPR '23 paper, "Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning"
☆10Jun 9, 2023Updated 3 years ago
xiaopp123 / knowledge_distillation
View on GitHub
bert蒸馏实践，包含BiLSTM蒸馏BERT和TinyBert
☆13Apr 23, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ZiangYan / pda.pytorch
View on GitHub
Implementation of our ICLR 2021 paper: Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples.
☆11Mar 9, 2021Updated 5 years ago
csfaculty / csfaculty.github.io
View on GitHub
Interview questions for Computer Science faculty jobs
☆43Mar 13, 2024Updated 2 years ago
rrgeorge-pdcontributions / NSFW-Words-List
View on GitHub
Text file containing NSFW words aggregated from various sources.
☆12Aug 23, 2020Updated 5 years ago
LukasStruppek / Exploiting-Cultural-Biases-via-Homoglyphs
View on GitHub
[Journal of Artificial Intelligence Research] Source code for our paper "Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synth…
☆12Jan 8, 2024Updated 2 years ago
meng-wenlong / LMSanitator
View on GitHub
☆29Aug 21, 2023Updated 2 years ago
thuiar / Robust-MSA
View on GitHub
☆11May 12, 2023Updated 3 years ago
QData / TextAttack-A2T
View on GitHub
A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)
☆27Sep 12, 2021Updated 4 years ago