SAFER: A Structure-free Approach For cErtified Robustness to Adversarial Word Substitutions (ACL 2020)
☆31Jan 27, 2021Updated 5 years ago
Alternatives and similar repositories for Structure-free-certified-NLP
Users that are interested in Structure-free-certified-NLP are comparing it to the libraries listed below
Sorting:
- Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)☆38Dec 30, 2019Updated 6 years ago
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆17Oct 8, 2024Updated last year
- ☆14May 7, 2024Updated last year
- codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"☆31Oct 27, 2023Updated 2 years ago
- ☆25May 6, 2021Updated 4 years ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆18Jun 12, 2023Updated 2 years ago
- ☆10Jun 23, 2018Updated 7 years ago
- "Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers" (NeurIPS 2019, previously called "A Stratified Approach …☆17Nov 16, 2019Updated 6 years ago
- This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022☆11Dec 6, 2022Updated 3 years ago
- This repo keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on …☆98Oct 18, 2022Updated 3 years ago
- [EMNLP'22] Textual Manifold-based Defense Against Natural Language Adversarial Examples☆11Apr 6, 2023Updated 2 years ago
- Imitation Adversarial Attacks for Black-box Neural Ranking Models☆12Feb 5, 2024Updated 2 years ago
- Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency☆73Mar 24, 2023Updated 2 years ago
- ☆11Mar 6, 2022Updated 4 years ago
- Implementation code for the paper "Generating Natural Language Adversarial Examples"☆171Aug 30, 2019Updated 6 years ago
- codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19☆15Feb 25, 2020Updated 6 years ago
- ☆14Jul 13, 2022Updated 3 years ago
- Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"☆88Apr 11, 2021Updated 4 years ago
- A Query Efficient Natural Language Attack in a Black Box Setting☆17Sep 23, 2021Updated 4 years ago
- Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".☆13Apr 18, 2022Updated 3 years ago
- CROWN: A Neural Network Verification Framework for Networks with General Activation Functions☆39Dec 13, 2018Updated 7 years ago
- An Open-Source Package for Textual Adversarial Attack.☆771Jul 20, 2023Updated 2 years ago
- Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)☆301Jul 25, 2024Updated last year
- Adversarial examples for Seq2Seq model in NLP☆40Nov 3, 2018Updated 7 years ago
- StoneSkipping model for detecting Chinese camouflaged spam☆20May 8, 2020Updated 5 years ago
- Natural Universal Trigger Search (NUTS)☆21Apr 17, 2021Updated 4 years ago
- [NAACL 2022] "SemAttack: Natural Textual Attacks via Different Semantic Spaces" by Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li☆21Jun 11, 2022Updated 3 years ago
- ☆25Jun 23, 2021Updated 4 years ago
- Must-read Papers on Textual Adversarial Attack and Defense☆1,576Jun 4, 2025Updated 9 months ago
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆25Dec 3, 2020Updated 5 years ago
- [EMNLP 2020] "T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack" by Boxin Wang, Hengzhi Pei, Boyuan Pan, Q…☆26Oct 19, 2021Updated 4 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆32Jul 11, 2022Updated 3 years ago
- ☆64Apr 11, 2022Updated 3 years ago
- ☆31Aug 28, 2023Updated 2 years ago
- Convex Layerwise Adversarial Training (COLT)☆29Jan 13, 2021Updated 5 years ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆3,369Jul 10, 2025Updated 7 months ago
- finds a different set of words that sound like the input☆10Feb 24, 2022Updated 4 years ago
- A Model for Natural Language Attack on Text Classification and Inference☆528Dec 8, 2022Updated 3 years ago
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents☆16Sep 16, 2025Updated 5 months ago