facebookresearch / radioactive-watermark
Code for the paper "Watermarking Makes Language Models Radioactive"
☆14Updated 3 months ago
Alternatives and similar repositories for radioactive-watermark:
Users that are interested in radioactive-watermark are comparing it to the libraries listed below
- Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"☆45Updated 11 months ago
- The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…☆68Updated 2 months ago
- ☆16Updated 10 months ago
- Code for our paper "Benchmarking the Robustness of Image Watermarks"☆61Updated 4 months ago
- ☆17Updated 9 months ago
- Code for the paper "Robustness of AI-Image Detectors: Fundamental Limits and Practical Attacks"☆27Updated 7 months ago
- List of T2I safety papers, updated daily, welcome to discuss using Discussions☆55Updated 5 months ago
- ☆22Updated 8 months ago
- ☆91Updated 11 months ago
- Official repository for "On the Multi-modal Vulnerability of Diffusion Models"☆14Updated 6 months ago
- Code for the paper "Autoregressive Perturbations for Data Poisoning" (NeurIPS 2022)☆18Updated 4 months ago
- Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Model…☆36Updated 2 months ago
- Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.☆105Updated last year
- Code for our S&P'21 paper: Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding☆52Updated 2 years ago
- [NeurIPS 2024 D&B Track] UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models by Yihua Zhang, Cho…☆61Updated 2 months ago
- ☆13Updated 7 months ago
- This code is the official implementation of WEvade.☆38Updated 10 months ago
- Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"☆37Updated 2 weeks ago
- ☆24Updated 2 weeks ago
- The code for the paper titled as "DifAttack: Query-Efficient Black-Box Attack via Disentangled Feature Space".☆15Updated 7 months ago
- Official repo to reproduce the paper "How to Backdoor Diffusion Models?" published at CVPR 2023☆85Updated 4 months ago
- [ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images☆25Updated last year
- [NeurIPS'2023] Official Code Repo:Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability☆94Updated last year
- ☆57Updated 2 years ago
- A curated list of watermarking schemes for generative AI models☆73Updated 6 months ago
- ☆26Updated 7 months ago
- [CCS'22] SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders☆18Updated 2 years ago
- [CVPR 2024] official code for SimAC☆17Updated last week
- WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models (CVPR 2024)☆15Updated 7 months ago
- ☆24Updated 6 months ago