joey1993 / bert-defenderLinks

codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19

☆15

Alternatives and similar repositories for bert-defender

Users that are interested in bert-defender are comparing it to the libraries listed below

Sorting:

lushleaf / Structure-free-certified-NLP
SAFER: A Structure-free Approach For cErtified Robustness to Adversarial Word Substitutions (ACL 2020)
☆31Updated 4 years ago
JHL-HUST / PWWS
Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency
☆71Updated 2 years ago
robinjia / certified-word-sub
Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)
☆38Updated 5 years ago
thunlp / SememePSO-Attack
Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"
☆87Updated 4 years ago
anyirao / WordAdver
Code for ACL2018 HotFlip: White-Box Adversarial Examples for Text Classification, Word-level Adversarial Examples
☆36Updated 6 years ago
cmhcbb / Seq2Sick
Adversarial examples for Seq2Seq model in NLP
☆40Updated 6 years ago
csong27 / collision-bert
☆26Updated 4 years ago
JHL-HUST / FGPM
Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks
☆24Updated 4 years ago
LinyangLee / BERT-Attack
Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT
☆199Updated 4 years ago
nesl / nlp_adversarial_examples
Implementation code for the paper "Generating Natural Language Adversarial Examples"
☆170Updated 5 years ago
cookielee77 / CLARE
Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021
☆43Updated 4 years ago
neulab / RIPPLe
Code for the paper "Weight Poisoning Attacks on Pre-trained Models" (ACL 2020)
☆142Updated 3 years ago
dugu9sword / dne
ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble
☆18Updated 2 years ago
RishabhMaheshwary / query-attack
A Query Efficient Natural Language Attack in a Black Box Setting
☆16Updated 3 years ago
orgoro / white-2-black
The official code to reproduce results from the NACCL2019 paper: White-to-Black: Efficient Distillation of Black-Box Adversarial Attacks
☆12Updated 6 years ago
Hsuan-Tung / universal_attack_natural_trigger
Natural Universal Trigger Search (NUTS)
☆21Updated 4 years ago
thunlp / NeuBA
☆25Updated 4 years ago
thunlp / StyleAttack
Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"
☆43Updated 2 years ago
AI-secure / T3
[EMNLP 2020] "T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack" by Boxin Wang, Hengzhi Pei, Boyuan Pan, Q…
☆26Updated 3 years ago
cecilialeiqi / adversarial_text
to add
☆20Updated 5 years ago
zjiehang / RanMASK
For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
☆16Updated 9 months ago
yanaiela / demog-text-removal
☆51Updated 6 years ago
LilyNLP / ADFAR
☆9Updated 4 years ago
Eric-Wallace / universal-triggers
Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)
☆295Updated last year
jebivid / adversarial-nmt
☆10Updated 7 years ago
alvinchangw / CARA_EMNLP2020
Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)
☆15Updated 4 years ago
thunlp / ONION
Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"
☆34Updated 3 years ago
RockyLzy / TextDefender
codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"
☆31Updated last year
danishpruthi / Adversarial-Misspellings
☆64Updated 3 years ago
AI-secure / InfoBERT
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Y…
☆85Updated last year