hlzhang109 / impossibility-watermarkLinks
[ICML 2024] Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models
☆24Updated last year
Alternatives and similar repositories for impossibility-watermark
Users that are interested in impossibility-watermark are comparing it to the libraries listed below
Sorting:
- Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"☆50Updated last year
- Code for watermarking language models☆82Updated last year
- Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024☆34Updated last year
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆34Updated 11 months ago
- ☆32Updated last year
- [ICLR 2024] Provable Robust Watermarking for AI-Generated Text☆36Updated last year
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Updated 2 years ago
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆89Updated last year
- ☆53Updated 2 years ago
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆61Updated last year
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆63Updated 10 months ago
- UP-TO-DATE LLM Watermark paper. 🔥🔥🔥☆361Updated 10 months ago
- Starter kit and data loading code for the Trojan Detection Challenge NeurIPS 2022 competition☆33Updated 2 years ago
- ☆26Updated 8 months ago
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆23Updated last year
- ☆21Updated last year
- Official repository for "PostMark: A Robust Blackbox Watermark for Large Language Models"☆27Updated last year
- ☆33Updated 5 months ago
- [ICML2024] Adaptive Text Watermark for Large Language Models☆23Updated 10 months ago
- ☆37Updated 10 months ago
- ☆171Updated 3 months ago
- Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"☆28Updated 2 years ago
- ☆39Updated last year
- ☆32Updated last year
- Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"☆109Updated 2 years ago
- ☆44Updated last year
- Code for our S&P'21 paper: Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding☆53Updated 2 years ago
- Robust natural language watermarking using invariant features☆26Updated 2 years ago
- Repository for Towards Codable Watermarking for Large Language Models☆38Updated 2 years ago
- ☆23Updated last year