zhao-ht / ConvexCertifyLinks
This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022
☆11Updated 2 years ago
Alternatives and similar repositories for ConvexCertify
Users that are interested in ConvexCertify are comparing it to the libraries listed below
Sorting:
- Group-conditional DRO to alleviate spurious correlations☆15Updated 4 years ago
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆50Updated 4 years ago
- Official code repository for Correct-N-Contrast☆22Updated 3 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆29Updated 3 years ago
- ☆14Updated 5 years ago
- Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)☆15Updated 4 years ago
- ☆25Updated 4 years ago
- [ICLR 2022] Boosting Randomized Smoothing with Variance Reduced Classifiers☆12Updated 3 years ago
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆16Updated 3 years ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 8 months ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆26Updated last year
- ☆24Updated 4 years ago
- Code for "Universal Adversarial Triggers Are Not Universal."☆17Updated last year
- ☆35Updated 6 months ago
- ☆44Updated 2 years ago
- ☆14Updated last year
- ☆17Updated 4 years ago
- ☆44Updated 5 months ago
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆13Updated 3 years ago
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆27Updated 7 months ago
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆21Updated last year
- A reproduced PyTorch implementation of the Adversarially Reweighted Learning (ARL) model, originally presented in "Fairness without Demog…☆20Updated 4 years ago
- Post-processing for fair classification☆15Updated 2 weeks ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆79Updated last year
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆38Updated 3 months ago
- Tensorflow implementation of Invariant Rationalization☆49Updated 2 years ago
- ☆21Updated last year
- [ICLR 2022] Understanding and Improving Graph Injection Attack by Promoting Unnoticeability☆38Updated last year
- The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".☆13Updated 3 years ago
- Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks☆24Updated 4 years ago