hlzhang109 / impossibility-watermarkLinks
[ICML 2024] Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models
☆24Updated last year
Alternatives and similar repositories for impossibility-watermark
Users that are interested in impossibility-watermark are comparing it to the libraries listed below
Sorting:
- Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"☆50Updated last year
- Code for watermarking language models☆84Updated last year
- ☆32Updated last year
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Updated 2 years ago
- ☆32Updated 2 years ago
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆90Updated last year
- [ICLR 2024] Provable Robust Watermarking for AI-Generated Text☆38Updated 2 years ago
- ☆27Updated 10 months ago
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆66Updated last year
- ☆33Updated 7 months ago
- ☆53Updated 2 years ago
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆37Updated last year
- [ICML 2023] Are Diffusion Models Vulnerable to Membership Inference Attacks?☆42Updated last year
- ☆46Updated last year
- [NeurIPS23 (Spotlight)] "Model Sparsity Can Simplify Machine Unlearning" by Jinghan Jia*, Jiancheng Liu*, Parikshit Ram, Yuguang Yao, Gao…☆81Updated last year
- Starter kit and data loading code for the Trojan Detection Challenge NeurIPS 2022 competition☆33Updated 2 years ago
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆63Updated last year
- Certified robustness "for free" using off-the-shelf diffusion models and classifiers☆44Updated 2 years ago
- The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…☆87Updated 9 months ago
- Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024☆34Updated last year
- This is an official repository for Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study (ICCV2023…☆24Updated 2 years ago
- Official Repository for Dataset Inference for LLMs☆43Updated last year
- ☆23Updated 2 years ago
- Python package for measuring memorization in LLMs.☆175Updated 5 months ago
- ☆23Updated last year
- Official repo to reproduce the paper "How to Backdoor Diffusion Models?" published at CVPR 2023☆94Updated 3 months ago
- [ICLR24 (Spotlight)] "SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation…☆141Updated 6 months ago
- ☆24Updated last year
- [ICML2024] Adaptive Text Watermark for Large Language Models☆25Updated last year
- ☆59Updated 2 years ago