zhao-ht / ConvexCertifyLinks
This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022
☆11Updated 3 years ago
Alternatives and similar repositories for ConvexCertify
Users that are interested in ConvexCertify are comparing it to the libraries listed below
Sorting:
- Group-conditional DRO to alleviate spurious correlations☆15Updated 4 years ago
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆16Updated 3 years ago
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆51Updated 4 years ago
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆13Updated 4 years ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆18Updated 2 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆31Updated 3 years ago
- Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)☆15Updated 5 years ago
- ☆14Updated 5 years ago
- codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"☆31Updated 2 years ago
- A reproduced PyTorch implementation of the Adversarially Reweighted Learning (ARL) model, originally presented in "Fairness without Demog…☆20Updated 4 years ago
- This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…☆17Updated 5 years ago
- ☆17Updated 4 years ago
- Tensorflow implementation of Invariant Rationalization☆49Updated 2 years ago
- Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks☆24Updated 5 years ago
- Official code repository for Correct-N-Contrast☆23Updated 3 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆10Updated last year
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆27Updated last year
- [ICLR 2022] Boosting Randomized Smoothing with Variance Reduced Classifiers☆12Updated 3 years ago
- The demo for "Convolutional Poisson Gamma Belief Network" published in ICML2019☆11Updated 3 years ago
- ☆25Updated 4 years ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆28Updated last year
- ☆14Updated last year
- ☆50Updated 2 years ago
- PyTorch code for the Neurips 2021 paper: Fairness via Representation Neutralization☆10Updated 4 years ago
- This is the project for IRM methods☆13Updated 4 years ago
- "Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers" (NeurIPS 2019, previously called "A Stratified Approach …☆17Updated 6 years ago
- ☆25Updated 4 years ago
- ☆14Updated 2 years ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆29Updated last year
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆83Updated last year