hlzhang109 / impossibility-watermarkLinks
[ICML 2024] Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models
☆24Updated last year
Alternatives and similar repositories for impossibility-watermark
Users that are interested in impossibility-watermark are comparing it to the libraries listed below
Sorting:
- Code for watermarking language models☆84Updated last year
- Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"☆50Updated last year
- ☆32Updated last year
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Updated 2 years ago
- [ICLR 2024] Provable Robust Watermarking for AI-Generated Text☆38Updated 2 years ago
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆90Updated last year
- ☆27Updated 11 months ago
- Certified robustness "for free" using off-the-shelf diffusion models and classifiers☆44Updated 2 years ago
- Starter kit and data loading code for the Trojan Detection Challenge NeurIPS 2022 competition☆33Updated 2 years ago
- ☆53Updated 2 years ago
- ☆23Updated last year
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆63Updated last year
- Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024☆34Updated last year
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆37Updated last year
- ☆33Updated 8 months ago
- [ICML 2023] Are Diffusion Models Vulnerable to Membership Inference Attacks?☆42Updated last year
- ☆46Updated last year
- Watermarking Text Generated by Black-Box Language Models☆39Updated 2 years ago
- Official repo to reproduce the paper "How to Backdoor Diffusion Models?" published at CVPR 2023☆94Updated 4 months ago
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆66Updated last year
- Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"☆110Updated 3 years ago
- Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"☆62Updated 2 years ago
- ☆32Updated 2 years ago
- ☆70Updated 11 months ago
- [ICCV 2023 Oral] Official implementation of "Robust Evaluation of Diffusion-Based Adversarial Purification"☆25Updated 2 years ago
- Code for the paper "Robustness of AI-Image Detectors: Fundamental Limits and Practical Attacks"☆39Updated last year
- ☆23Updated 2 years ago
- This code is the official implementation of WEvade.☆40Updated last year
- Differentially Private Diffusion Models☆105Updated 2 years ago
- This is the official implementation of our paper 'Untargeted Backdoor Watermark: Towards Harmless and Stealthy Dataset Copyright Protecti…☆58Updated last year