maximilianmozes / fgwsLinks

Frequency-Guided Word Substitutions for Detecting Textual Adversarial Examples (EACL 2021)

☆8

Alternatives and similar repositories for fgws

Users that are interested in fgws are comparing it to the libraries listed below

Sorting:

cookielee77 / CLARE
Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021
☆43Updated 4 years ago
joeljang / knowledge-unlearning
[ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models
☆82Updated 10 months ago
zjiehang / RanMASK
For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
☆16Updated 10 months ago
dongxinshuai / ASCC
☆24Updated 4 years ago
RockyLzy / TextDefender
codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"
☆31Updated last year
dugu9sword / dne
ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble
☆18Updated 2 years ago
QData / TextAttack-A2T
A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)
☆26Updated 3 years ago
snw2021 / LLM_Unlearning_Papers
☆26Updated last year
Hsuan-Tung / universal_attack_natural_trigger
Natural Universal Trigger Search (NUTS)
☆21Updated 4 years ago
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆102Updated 2 years ago
pliang279 / LM_bias
[ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models
☆61Updated 2 years ago
mireshghallah / mixmatch
Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models
☆44Updated 3 years ago
leix28 / prompt-universal-vulnerability
Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022
☆30Updated 3 years ago
mkshing / Prompt-Tuning
Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"
☆166Updated 3 years ago
SALT-NLP / mic
Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"
☆20Updated 2 years ago
pliang279 / sent_debias
[ACL 2020] Towards Debiasing Sentence Representations
☆66Updated 2 years ago
mireshghallah / ft-memorization
☆13Updated 2 years ago
GXimingLu / Quark
☆75Updated last year
rivercold / BERT-unsupervised-OOD
Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"
☆30Updated 3 years ago
qcwthu / Lifelong-Fewshot-Language-Learning
The code for lifelong few-shot language learning
☆55Updated 3 years ago
kawine / dataset_difficulty
"Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)
☆87Updated last year
minicheshire / Robust-Prefix-Tuning
code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification
☆27Updated 3 years ago
xiangyue9607 / QVE
Code for the ACL2022 paper "Synthetic Question Value Estimation for Domain Adaptation of Question Answering"
☆17Updated 3 years ago
AI-secure / InfoBERT
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Y…
☆85Updated last year
princeton-nlp / MABEL
EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975
☆38Updated last year
wzhouad / Contra-OOD
Source code for paper "Contrastive Out-of-Distribution Detection for Pretrained Transformers", EMNLP 2021
☆40Updated 3 years ago
UKPLab / emnlp2020-debiasing-unknown
☆26Updated 4 years ago
thunlp / MixADA
☆21Updated 4 years ago
thunlp / ONION
Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"
☆34Updated 3 years ago
SALT-NLP / Efficient_Unlearning
☆38Updated last year