thunlp/NeuBA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thunlp/NeuBA)

thunlp / NeuBA

☆25

Alternatives and similar repositories for NeuBA

Users that are interested in NeuBA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kangjie-chen / BadPre
View on GitHub
☆12Feb 21, 2022Updated 4 years ago
lancopku / RAP
View on GitHub
Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)
☆25Oct 21, 2021Updated 4 years ago
lancopku / Embedding-Poisoning
View on GitHub
Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…
☆45Jul 26, 2021Updated 4 years ago
haowang02 / TransTroj
View on GitHub
[WWW '25] Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability
☆18May 30, 2025Updated 11 months ago
lancopku / SOS
View on GitHub
Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)
☆24Dec 9, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
meng-wenlong / LMSanitator
View on GitHub
☆29Aug 21, 2023Updated 2 years ago
joey1993 / bert-defender
View on GitHub
codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19
☆15Feb 25, 2020Updated 6 years ago
PurduePAML / PICCOLO
View on GitHub
☆26Dec 1, 2022Updated 3 years ago
leix28 / prompt-universal-vulnerability
View on GitHub
Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022
☆32Jul 11, 2022Updated 3 years ago
leileigan / clean_label_textual_backdoor_attack
View on GitHub
☆19Feb 10, 2022Updated 4 years ago
turboLJY / Transfer-Prompts-for-Text-Generation
View on GitHub
☆16Aug 14, 2022Updated 3 years ago
genglinliu / UnknownBench
View on GitHub
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge
☆14Feb 20, 2024Updated 2 years ago
bboylyg / ABL
View on GitHub
Anti-Backdoor learning (NeurIPS 2021)
☆83Jul 20, 2023Updated 2 years ago
zeyuanyin / LTH-Backdoor
View on GitHub
[Preprint] Backdoor Attacks on Federated Learning with Lottery Ticket Hypothesis
☆10Sep 23, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pouyapez / criage
View on GitHub
Investigating Robustness and Interpretability of Link Prediction via Adversarial Modifications
☆19Jul 20, 2020Updated 5 years ago
Huiying-Li / Latent-Backdoor
View on GitHub
This is the documentation of the Tensorflow/Keras implementation of Latent Backdoor Attacks. Please see the paper for details Latent Back…
☆23Sep 8, 2021Updated 4 years ago
qinliu9 / Flooding-X
View on GitHub
☆14Jul 13, 2022Updated 3 years ago
jianyizhang123 / FLOP
View on GitHub
☆10Jan 31, 2022Updated 4 years ago
lancopku / CascadeBERT
View on GitHub
Code for CascadeBERT, Findings of EMNLP 2021
☆12Mar 30, 2022Updated 4 years ago
uchicago-sandlab / naturalbackdoors
View on GitHub
Code for identifying natural backdoors in existing image datasets.
☆15Aug 24, 2022Updated 3 years ago
jinyuan-jia / BadEncoder
View on GitHub
☆84Aug 3, 2021Updated 4 years ago
yjkim721 / STRIP-ViTA
View on GitHub
This work corroborates a run-time Trojan detection method exploiting STRong Intentional Perturbation of inputs, is a multi-domain Trojan …
☆10Mar 7, 2021Updated 5 years ago
Megum1 / DFST
View on GitHub
[AAAI'21] Deep Feature Space Trojan Attack of Neural Networks by Controlled Detoxification
☆30Dec 31, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hsuan-Tung / universal_attack_natural_trigger
View on GitHub
Natural Universal Trigger Search (NUTS)
☆21Apr 17, 2021Updated 5 years ago
neulab / RIPPLe
View on GitHub
Code for the paper "Weight Poisoning Attacks on Pre-trained Models" (ACL 2020)
☆143Sep 22, 2025Updated 8 months ago
HanxunH / Detect-CLIP-Backdoor-Samples
View on GitHub
[ICLR2025] Detecting Backdoor Samples in Contrastive Language Image Pretraining
☆20Feb 26, 2025Updated last year
thunlp / OpenBackdoor
View on GitHub
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
☆209Apr 10, 2023Updated 3 years ago
wangbo9719 / MEXTRA
View on GitHub
Source code for the ACL'2025 paper titled "Unveiling privacy risks in llm agent memory"
☆30Dec 2, 2025Updated 5 months ago
VITA-Group / Trap-and-Replace-Backdoor-Defense
View on GitHub
[NeurIPS'22] Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork. Haotao Wang, Junyuan Hong,…
☆14Nov 27, 2023Updated 2 years ago
xingyizhao / PURE
View on GitHub
Code associated with ICML (2024). "Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normaliz…
☆10Feb 22, 2026Updated 3 months ago
HXZhong1997 / FSBA
View on GitHub
☆11Jan 25, 2022Updated 4 years ago
google-research-datasets / adversarial-nibbler
View on GitHub
This dataset contains results from all rounds of Adversarial Nibbler. This data includes adversarial prompts fed into public generative t…
☆27Feb 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SJTUHaiyangYu / BackdoorMBTI
View on GitHub
BackdoorMBTI is an open source project expanding the unimodal backdoor learning to a multimodal context. We hope that BackdoorMBTI can fa…
☆28Updated this week
Twilight92z / Quantize-Watermark
View on GitHub
☆19Nov 6, 2023Updated 2 years ago
xinleihe / toxic-prompt
View on GitHub
☆27Nov 20, 2023Updated 2 years ago
qijimrc / ROBUST
View on GitHub
☆13Oct 19, 2023Updated 2 years ago
locuslab / breaking-poisoned-classifier
View on GitHub
Code for paper "Poisoned classifiers are not only backdoored, they are fundamentally broken"
☆26Jan 7, 2022Updated 4 years ago
xiaopp123 / knowledge_distillation
View on GitHub
bert蒸馏实践，包含BiLSTM蒸馏BERT和TinyBert
☆13Apr 23, 2022Updated 4 years ago
YiZeng623 / DeepSweep
View on GitHub
An evaluation framework for mitigating DNN backdoor attacks using data augmentations
☆11Dec 10, 2020Updated 5 years ago