AI-secure / InfoBERT
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu
☆85Updated last year
Alternatives and similar repositories for InfoBERT
Users that are interested in InfoBERT are comparing it to the libraries listed below
Sorting:
- [EMNLP 2020] "T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack" by Boxin Wang, Hengzhi Pei, Boyuan Pan, Q…☆26Updated 3 years ago
- Implementation for Variational Information Bottleneck for Effective Low-resource Fine-tuning, ICLR 2021☆39Updated 4 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆26Updated 3 years ago
- ☆26Updated 4 years ago
- ☆24Updated 4 years ago
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆38Updated 3 years ago
- ☆20Updated 3 years ago
- Official code for the paper "PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains".☆51Updated 3 years ago
- OOD Generalization and Detection (ACL 2020)☆60Updated 5 years ago
- Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021☆43Updated 3 years ago
- Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency☆69Updated 2 years ago
- Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)☆38Updated 5 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated 2 years ago
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆42Updated 3 years ago
- Language Model Baselines for PyTorch☆42Updated 4 years ago
- ☆75Updated last year
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆17Updated 4 years ago
- ☆63Updated 5 years ago
- ☆42Updated 4 years ago
- NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790☆27Updated 2 years ago
- Code for ACL2018 HotFlip: White-Box Adversarial Examples for Text Classification, Word-level Adversarial Examples☆35Updated 6 years ago
- ☆53Updated 2 years ago
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year
- ☆44Updated last year
- The code for lifelong few-shot language learning☆55Updated 3 years ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Updated 3 years ago
- Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)☆15Updated 4 years ago
- Code for "Imitation Attacks and Defenses for Black-box Machine Translations Systems"☆36Updated 5 years ago
- ☆25Updated 3 years ago