zhao-ht / ConvexCertifyLinks

This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022

☆11

Alternatives and similar repositories for ConvexCertify

Users that are interested in ConvexCertify are comparing it to the libraries listed below

Sorting:

zleizzo / datadeletion
☆14Updated 5 years ago
alvinchangw / CARA_EMNLP2020
Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)
☆15Updated 4 years ago
leix28 / prompt-universal-vulnerability
Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022
☆30Updated 3 years ago
eth-sri / smoothing-ensembles
[ICLR 2022] Boosting Randomized Smoothing with Variance Reduced Classifiers
☆12Updated 3 years ago
HazyResearch / correct-n-contrast
Official code repository for Correct-N-Contrast
☆22Updated 3 years ago
violet-zct / group-conditional-DRO
Group-conditional DRO to alleviate spurious correlations
☆15Updated 4 years ago
p-lambda / in-n-out
Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"
☆13Updated 3 years ago
thunlp / NeuBA
☆25Updated 4 years ago
srzer / MOD
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
☆25Updated 9 months ago
JHL-HUST / FGPM
Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks
☆24Updated 4 years ago
RockyLzy / TextDefender
codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"
☆31Updated last year
zaixizhang / CBD
Official Inplementation of CVPR23 paper "Backdoor Defense via Deconfounded Representation Learning"
☆26Updated 2 years ago
ecreager / eiil
Code for Environment Inference for Invariant Learning (ICML 2021 Paper)
☆50Updated 4 years ago
zjiehang / RanMASK
For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
☆16Updated 10 months ago
UCSB-NLP-Chang / SelfDenoise
☆14Updated last year
ybjiaang / ACTIR
Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"
☆16Updated 3 years ago
dugu9sword / dne
ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble
☆18Updated 2 years ago
sophie-xhonneux / Continuous-AdvTrain
☆27Updated last year
mireshghallah / neighborhood-curvature-mia
☆21Updated last year
Vaidehi99 / InfoDeletionAttacks
☆44Updated 6 months ago
dongxinshuai / ASCC
☆24Updated 4 years ago
yangarbiter / rare-spurious-correlation
Understanding Rare Spurious Correlations in Neural Network
☆12Updated 3 years ago
tatsu-lab / linguistic_calibration
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆27Updated last year
locuslab / acr-memorization
☆35Updated 7 months ago
shizhouxing / Robustness-Verification-for-Transformers
[ICLR 2020] Code for paper "Robustness Verification for Transformers"
☆27Updated 8 months ago
successar / instance_attributions_NLP
☆17Updated 4 years ago
UCSB-NLP-Chang / llm_uncertainty
☆32Updated last year
launchnlp / BOLT
Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".
☆20Updated last year
code-terminator / invariant_rationalization
Tensorflow implementation of Invariant Rationalization
☆49Updated 2 years ago
skywalker023 / confaide
🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…
☆43Updated last year