QData / TextAttack-A2T
A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)
☆26Updated 3 years ago
Alternatives and similar repositories for TextAttack-A2T:
Users that are interested in TextAttack-A2T are comparing it to the libraries listed below
- ☆24Updated 3 years ago
- Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency☆69Updated last year
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆32Updated 3 years ago
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆41Updated 2 years ago
- code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification☆27Updated 2 years ago
- [ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Y…☆84Updated last year
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆29Updated 2 years ago
- Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021☆43Updated 3 years ago
- [EMNLP 2020] "T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack" by Boxin Wang, Hengzhi Pei, Boyuan Pan, Q…☆26Updated 3 years ago
- Natural Universal Trigger Search (NUTS)☆21Updated 3 years ago
- EMNLP BlackBox NLP 2020: Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples☆23Updated 4 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆19Updated last year
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆15Updated 4 months ago
- On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification☆30Updated 2 years ago
- ☆44Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- ☆24Updated 3 years ago
- ☆20Updated 3 years ago
- NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790☆27Updated 2 years ago
- Restore safety in fine-tuned language models through task arithmetic☆27Updated 10 months ago
- Code for ACL2018 HotFlip: White-Box Adversarial Examples for Text Classification, Word-level Adversarial Examples☆34Updated 5 years ago
- Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…☆39Updated 3 years ago
- Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)☆24Updated 3 years ago
- Code for "Universal Adversarial Triggers Are Not Universal."☆16Updated 9 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆57Updated last year
- Official Code for the papers: "Controlled Text Generation as Continuous Optimization with Multiple Constraints" and "Gradient-based Const…☆58Updated 11 months ago
- ☆26Updated 3 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year
- Text generation using language models with multiple exit heads☆15Updated 2 weeks ago
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆17Updated 4 years ago