zhao-ht / ConvexCertifyLinks
This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022
☆11Updated 2 years ago
Alternatives and similar repositories for ConvexCertify
Users that are interested in ConvexCertify are comparing it to the libraries listed below
Sorting:
- ☆14Updated 5 years ago
- Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)☆15Updated 4 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆30Updated 3 years ago
- [ICLR 2022] Boosting Randomized Smoothing with Variance Reduced Classifiers☆12Updated 3 years ago
- Official code repository for Correct-N-Contrast☆22Updated 3 years ago
- Group-conditional DRO to alleviate spurious correlations☆15Updated 4 years ago
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆13Updated 3 years ago
- ☆25Updated 4 years ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 9 months ago
- Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks☆24Updated 4 years ago
- codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"☆31Updated last year
- Official Inplementation of CVPR23 paper "Backdoor Defense via Deconfounded Representation Learning"☆26Updated 2 years ago
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆50Updated 4 years ago
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆16Updated 10 months ago
- ☆14Updated last year
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆16Updated 3 years ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆18Updated 2 years ago
- ☆27Updated last year
- ☆21Updated last year
- ☆44Updated 6 months ago
- ☆24Updated 4 years ago
- Understanding Rare Spurious Correlations in Neural Network☆12Updated 3 years ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆27Updated last year
- ☆35Updated 7 months ago
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆27Updated 8 months ago
- ☆17Updated 4 years ago
- ☆32Updated last year
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆20Updated last year
- Tensorflow implementation of Invariant Rationalization☆49Updated 2 years ago
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆43Updated last year