codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"
☆31Oct 27, 2023Updated 2 years ago
Alternatives and similar repositories for TextDefender
Users that are interested in TextDefender are comparing it to the libraries listed below
Sorting:
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆18Jun 12, 2023Updated 2 years ago
- ☆25May 6, 2021Updated 4 years ago
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆17Oct 8, 2024Updated last year
- [EMNLP'22] Textual Manifold-based Defense Against Natural Language Adversarial Examples☆11Apr 6, 2023Updated 2 years ago
- SAFER: A Structure-free Approach For cErtified Robustness to Adversarial Word Substitutions (ACL 2020)☆31Jan 27, 2021Updated 5 years ago
- Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)☆38Dec 30, 2019Updated 6 years ago
- ☆10Oct 28, 2020Updated 5 years ago
- codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19☆15Feb 25, 2020Updated 6 years ago
- [EMNLP 2020] "T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack" by Boxin Wang, Hengzhi Pei, Boyuan Pan, Q…☆26Oct 19, 2021Updated 4 years ago
- Implementation of "Effective Adversarial Regularization for Neural Machine Translation", ACL 2019☆21Jan 11, 2020Updated 6 years ago
- Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"☆88Apr 11, 2021Updated 4 years ago
- A tool for deploying many tasks automatically.☆11Jan 16, 2025Updated last year
- ☆31Aug 28, 2023Updated 2 years ago
- Adversarial examples for Seq2Seq model in NLP☆40Nov 3, 2018Updated 7 years ago
- Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)☆21Sep 27, 2022Updated 3 years ago
- Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency☆76Mar 24, 2023Updated 2 years ago
- ☆11Mar 6, 2022Updated 4 years ago
- my commonly-used tools☆64Jan 7, 2025Updated last year
- Natural Universal Trigger Search (NUTS)☆21Apr 17, 2021Updated 4 years ago
- From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks☆14Feb 23, 2023Updated 3 years ago
- ☆21Jun 3, 2021Updated 4 years ago
- Code for the paper "MMA Training: Direct Input Space Margin Maximization through Adversarial Training"☆34Apr 1, 2020Updated 5 years ago
- Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".☆13Apr 18, 2022Updated 3 years ago
- Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"☆16Dec 4, 2024Updated last year
- Coupling rejection strategy against adversarial attacks (CVPR 2022)☆29Mar 2, 2022Updated 4 years ago
- Crafting Adversarial Examples for Neural Machine Translation☆10Apr 7, 2023Updated 2 years ago
- [NAACL 2022] "SemAttack: Natural Textual Attacks via Different Semantic Spaces" by Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li☆21Jun 11, 2022Updated 3 years ago
- A framework for adversarial attacks against token classification models☆33Nov 6, 2021Updated 4 years ago
- A machine learning algorithm library in pure Python with mini project included for every algorithm.☆11Oct 23, 2017Updated 8 years ago
- Code for SPINE - Sparse Interpretable Neural Embeddings. Jhamtani H.*, Pruthi D.*, Subramanian A.*, Berg-Kirkpatrick T., Hovy E. AAAI 20…☆14Jan 25, 2020Updated 6 years ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆3,379Jul 10, 2025Updated 8 months ago
- A Model for Natural Language Attack on Text Classification and Inference☆530Dec 8, 2022Updated 3 years ago
- ☆10Jun 23, 2018Updated 7 years ago
- EMNLP BlackBox NLP 2020: Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples☆26Oct 11, 2020Updated 5 years ago
- Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"☆110Dec 28, 2022Updated 3 years ago
- Adversarial Training for Natural Language Understanding☆253Sep 6, 2023Updated 2 years ago
- [ICLR 2024] Provable Robust Watermarking for AI-Generated Text☆38Dec 4, 2023Updated 2 years ago
- Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)☆301Jul 25, 2024Updated last year
- Code repository of the paper "Alleviating Adversarial Attacks on Variational Autoencoders with MCMC" published at NeurIPS 2022. https://a…☆10Dec 14, 2022Updated 3 years ago