thunlp/ONION

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thunlp/ONION)

thunlp / ONION

Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"

☆39

Alternatives and similar repositories for ONION

Users that are interested in ONION are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thunlp / HiddenKiller
View on GitHub
Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"
☆46Sep 11, 2022Updated 3 years ago
thunlp / StyleAttack
View on GitHub
Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"
☆46Oct 12, 2022Updated 3 years ago
lancopku / Embedding-Poisoning
View on GitHub
Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…
☆44Jul 26, 2021Updated 4 years ago
lancopku / RAP
View on GitHub
Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)
☆25Oct 21, 2021Updated 4 years ago
thunlp / OpenBackdoor
View on GitHub
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
☆209Apr 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thunlp / BkdAtk-LWS
View on GitHub
Code and data of the ACL 2021 paper "Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution"
☆16Jun 29, 2021Updated 5 years ago
lancopku / DAN
View on GitHub
[Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks
☆13Feb 26, 2023Updated 3 years ago
alevine0 / DPA
View on GitHub
Code for the paper "Deep Partition Aggregation: Provable Defenses against General Poisoning Attacks"
☆14Aug 22, 2022Updated 3 years ago
neeharperi / DeepKNNDefense
View on GitHub
KNN Defense Against Clean Label Poisoning Attacks
☆13Sep 24, 2021Updated 4 years ago
UKPLab / emnlp2020-debiasing-unknown
View on GitHub
☆26Apr 15, 2021Updated 5 years ago
ReliableCoding / REPEAT
View on GitHub
☆10Apr 15, 2023Updated 3 years ago
gauss5930 / AlpaGasus2-QLoRA
View on GitHub
This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!
☆15Nov 22, 2023Updated 2 years ago
marcotcr / qa_consistency
View on GitHub
Evaluate QA models for consistency
☆20Nov 21, 2022Updated 3 years ago
VITA-Group / Trap-and-Replace-Backdoor-Defense
View on GitHub
[NeurIPS'22] Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork. Haotao Wang, Junyuan Hong,…
☆15Nov 27, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
alvinchangw / CARA_EMNLP2020
View on GitHub
Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)
☆15Oct 8, 2020Updated 5 years ago
RU-System-Software-and-Security / FeatureRE
View on GitHub
☆27Nov 9, 2022Updated 3 years ago
INK-USC / rockner
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
NTDXYG / COTTON
View on GitHub
Data and code for "Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models", which accepted in TSE.
☆15Jul 3, 2024Updated 2 years ago
TDteach / Demon-in-the-Variant
View on GitHub
☆13Oct 21, 2021Updated 4 years ago
ds4an / CoDas4CG
View on GitHub
Contests based Dataset for Code Generation
☆13Dec 11, 2022Updated 3 years ago
BuiltOntheRock / FSE22_BuiltOntheRock
View on GitHub
☆26Jul 19, 2022Updated 4 years ago
thunlp / LLM-generated-text-detection
View on GitHub
☆13Nov 7, 2023Updated 2 years ago
WhitolfChen / SCAR
View on GitHub
[NeurIPS 2025] Taught Well Learned Ill: Towards Distillation-conditional Backdoor Attack
☆15Nov 19, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DeepSoftwareAnalytics / Telly
View on GitHub
Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
☆23Apr 9, 2023Updated 3 years ago
SRI-CSL / TrinityMultimodalTrojAI
View on GitHub
☆35Jun 27, 2022Updated 4 years ago
reds-lab / ASSET
View on GitHub
This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…
☆20Jun 7, 2023Updated 3 years ago
nuaa-nlp / TrustworthyAIPapers
View on GitHub
List of Papers on Attack and Defense (AD) in AI Models
☆29Mar 18, 2022Updated 4 years ago
miyyer / scpn
View on GitHub
syntactically controlled paraphrase networks
☆168Dec 30, 2018Updated 7 years ago
penghui-yang / awesome-data-poisoning-and-backdoor-attacks
View on GitHub
A curated list of papers & resources linked to data poisoning, backdoor attacks and defenses against them (no longer maintained)
☆292Jan 11, 2025Updated last year
DunZhang / DomainSpecificThesaurus
View on GitHub
☆15Jan 19, 2020Updated 6 years ago
Yangyi-Chen / PaperList-Trustworthy-Applications
View on GitHub
Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…
☆21May 30, 2023Updated 3 years ago
isi-nlp / ai2
View on GitHub
Framework for testing models with AI2 leaderboards
☆21Nov 8, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wyu-du / GP-VAE
View on GitHub
This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Pr…
☆26Jun 27, 2022Updated 4 years ago
wanlunsec / Beatrix
View on GitHub
☆28Feb 1, 2023Updated 3 years ago
thunlp / SDLM-pytorch
View on GitHub
Code accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts
☆25Dec 31, 2018Updated 7 years ago
ZrW00 / GraCeFul
View on GitHub
The code implementation of GraCeFul (Accepted in COLING 2025)
☆13Jan 27, 2025Updated last year
utahnlp / consistency
View on GitHub
Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models
☆29Jun 13, 2021Updated 5 years ago
leix28 / prompt-universal-vulnerability
View on GitHub
Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022
☆32Jul 11, 2022Updated 4 years ago
thunlp / NeuBA
View on GitHub
☆25Jun 23, 2021Updated 5 years ago