codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19
☆15Feb 25, 2020Updated 6 years ago
Alternatives and similar repositories for bert-defender
Users that are interested in bert-defender are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25May 6, 2021Updated 4 years ago
- Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)☆38Dec 30, 2019Updated 6 years ago
- ☆14Jul 13, 2022Updated 3 years ago
- codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"☆31Oct 27, 2023Updated 2 years ago
- Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"☆88Apr 11, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25Jun 23, 2021Updated 4 years ago
- Reproduce multiple adversarial attack methods☆21May 5, 2020Updated 5 years ago
- [ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Y…☆85Oct 25, 2023Updated 2 years ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆17Jun 12, 2023Updated 2 years ago
- Natural Universal Trigger Search (NUTS)☆21Apr 17, 2021Updated 5 years ago
- ☆64Apr 11, 2022Updated 4 years ago
- ☆12Jul 13, 2022Updated 3 years ago
- ☆10May 18, 2022Updated 3 years ago
- Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks☆24Dec 11, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for "Counterfactual Variable Control for Robust and Interpretable Question Answering"☆14Oct 13, 2020Updated 5 years ago
- ☆27Nov 20, 2023Updated 2 years ago
- my commonly-used tools☆64Jan 7, 2025Updated last year
- CROWN: A Neural Network Verification Framework for Networks with General Activation Functions☆39Dec 13, 2018Updated 7 years ago
- Code for "Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models (CoNLL 2018)"☆15Feb 6, 2019Updated 7 years ago
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆14Mar 1, 2025Updated last year
- Natural Language Attacks in a Hard Label Black Box Setting.☆50May 26, 2021Updated 4 years ago
- Implementation Code of TextHoaxer☆15Aug 21, 2022Updated 3 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Dec 17, 2025Updated 4 months ago
- ☆14Mar 5, 2026Updated last month
- Paper list of Adversarial Examples☆51Oct 12, 2023Updated 2 years ago
- The official code to reproduce results from the NACCL2019 paper: White-to-Black: Efficient Distillation of Black-Box Adversarial Attacks☆12Jun 4, 2019Updated 6 years ago
- A PyTorch Implementation of the EMNLP 2020 paper "Mitigating Gender Bias for Neural Dialogue Generation with Adversarial Learning"☆13Feb 20, 2021Updated 5 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆32Jul 11, 2022Updated 3 years ago
- ☆14Apr 11, 2024Updated 2 years ago
- Efficient Neural Network Loss Landscape Generation☆10Apr 26, 2019Updated 6 years ago
- ☆57Oct 9, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Developed with the UROP, Detecting Deep Learning Software Defects (Spring 2019), HKUST☆25Dec 24, 2020Updated 5 years ago
- CoQA QuesAns using BERT, PyTorch and Hugging Face Transformers☆19Jun 9, 2021Updated 4 years ago
- ☆14Mar 10, 2020Updated 6 years ago
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆26Nov 26, 2024Updated last year
- This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022☆11Dec 6, 2022Updated 3 years ago
- ☆15Jul 26, 2024Updated last year
- Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models☆34Apr 18, 2023Updated 2 years ago