zhao-ht / ConvexCertify
This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ConvexCertify
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆49Updated 3 years ago
- This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence…☆16Updated 4 years ago
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆12Updated 2 years ago
- Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)☆15Updated 4 years ago
- ☆26Updated 9 months ago
- Official code repository for Correct-N-Contrast☆20Updated 2 years ago
- A reproduced PyTorch implementation of the Adversarially Reweighted Learning (ARL) model, originally presented in "Fairness without Demog…☆20Updated 3 years ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆15Updated this week
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆15Updated last year
- ☆30Updated 3 years ago
- Tensorflow implementation of Invariant Rationalization☆48Updated last year
- Group-conditional DRO to alleviate spurious correlations☆15Updated 3 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆27Updated 2 years ago
- ☆15Updated 4 years ago
- This is the project for IRM methods☆12Updated 3 years ago
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆15Updated last month
- [ICLR 2022] Understanding and Improving Graph Injection Attack by Promoting Unnoticeability☆37Updated 11 months ago
- Official Inplementation of CVPR23 paper "Backdoor Defense via Deconfounded Representation Learning"☆25Updated last year
- ☆9Updated last year
- ☆27Updated last year
- Official implementation for KDD'22 paper "Learning Fair Representation via Distributional Contrastive Disentanglement"☆22Updated 2 years ago
- SAFER: A Structure-free Approach For cErtified Robustness to Adversarial Word Substitutions (ACL 2020)☆27Updated 3 years ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆63Updated 8 months ago
- Understanding Rare Spurious Correlations in Neural Network☆11Updated 2 years ago
- ☆26Updated 2 weeks ago
- The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".☆12Updated 3 years ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆13Updated 2 weeks ago
- Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆10Updated 4 months ago
- ☆23Updated 3 years ago
- Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021☆43Updated 3 years ago