hlzhang109 / impossibility-watermarkLinks
[ICML 2024] Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models
☆24Updated last year
Alternatives and similar repositories for impossibility-watermark
Users that are interested in impossibility-watermark are comparing it to the libraries listed below
Sorting:
- Code for watermarking language models☆84Updated last year
- Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"☆50Updated last year
- ☆32Updated last year
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Updated 2 years ago
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆35Updated last year
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆90Updated last year
- [ICLR 2024] Provable Robust Watermarking for AI-Generated Text☆37Updated last year
- Starter kit and data loading code for the Trojan Detection Challenge NeurIPS 2022 competition☆33Updated 2 years ago
- ☆27Updated 9 months ago
- ☆53Updated 2 years ago
- [ICML 2023] Are Diffusion Models Vulnerable to Membership Inference Attacks?☆42Updated last year
- Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024☆34Updated last year
- UP-TO-DATE LLM Watermark paper. 🔥🔥🔥☆365Updated 11 months ago
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆62Updated last year
- ☆32Updated 2 years ago
- ☆59Updated 2 years ago
- ☆33Updated 6 months ago
- ☆46Updated last year
- [NeurIPS23 (Spotlight)] "Model Sparsity Can Simplify Machine Unlearning" by Jinghan Jia*, Jiancheng Liu*, Parikshit Ram, Yuguang Yao, Gao…☆81Updated last year
- Official Repository for Dataset Inference for LLMs☆43Updated last year
- Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"☆30Updated 2 years ago
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆63Updated 11 months ago
- Code for our S&P'21 paper: Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding☆53Updated 3 years ago
- ☆23Updated last year
- Repository for Towards Codable Watermarking for Large Language Models☆38Updated 2 years ago
- [ICML2024] Adaptive Text Watermark for Large Language Models☆23Updated 11 months ago
- Official repo to reproduce the paper "How to Backdoor Diffusion Models?" published at CVPR 2023☆94Updated 2 months ago
- ☆40Updated last year
- [ICLR24 (Spotlight)] "SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation…☆139Updated 6 months ago
- ☆70Updated 9 months ago