yepengliu / adaptive-text-watermarkLinks

[ICML2024] Adaptive Text Watermark for Large Language Models

☆23

Alternatives and similar repositories for adaptive-text-watermark

Users that are interested in adaptive-text-watermark are comparing it to the libraries listed below

Sorting:

THU-BPM / unforgeable_watermark
Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024
☆34Updated last year
hzy312 / Awesome-LLM-Watermark
UP-TO-DATE LLM Watermark paper. 🔥🔥🔥
☆358Updated 10 months ago
hongcheki / sweet-watermark
Official repository of the paper: Who Wrote this Code? Watermarking for Code Generation (ACL 2024)
☆38Updated last year
THU-BPM / Robust_Watermark
Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.
☆34Updated 11 months ago
xiaoniu-578fa6bff964d005 / UnbiasedWatermark
☆39Updated last year
xiaojunxu / learning-to-watermark-llm
☆20Updated last year
chenchenygu / watermark-learnability
☆26Updated 8 months ago
ruisizhang123 / REMARK-LLM
[USENIX Security'24] REMARK-LLM: A robust and efficient watermarking framework for generative large language models
☆25Updated 11 months ago
jthickstun / watermark
Code for watermarking language models
☆82Updated last year
facebookresearch / three_bricks
Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"
☆48Updated last year
THU-BPM / Watermark-Radioactivity-Attack
Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?". (ACL 2025 Main)
☆17Updated 4 months ago
git-disl / awesome_LLM-harmful-fine-tuning-papers
A survey on harmful fine-tuning attack for large language model
☆216Updated last week
isXinLiu / MM-SafetyBench
Accepted by ECCV 2024
☆160Updated last year
abehou / SemStamp
Repo for SemStamp (NAACL2024) and k-SemStamp (ACL2024)
☆23Updated 10 months ago
isXinLiu / Awesome-MLLM-Safety
Accepted by IJCAI-24 Survey Track
☆218Updated last year
mengtong0110 / InferDPT
☆32Updated 6 months ago
thu-ml / MMTrustEval
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
☆167Updated 3 months ago
OPTML-Group / Diffusion-MU-Attack
The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…
☆83Updated 7 months ago
XuandongZhao / Unigram-Watermark
[ICLR 2024] Provable Robust Watermarking for AI-Generated Text
☆35Updated last year
bangawayoo / nlp-watermarking
Robust natural language watermarking using invariant features
☆26Updated 2 years ago
THU-BPM / Watermarked_LLM_Identification
Code and data for paper "Can Watermarked LLMs be Identified by Users via Crafted Prompts?" Accepted by ICLR 2025 (Spotlight)
☆25Updated 9 months ago
MartinPawelczyk / In-Context-Unlearning
"In-Context Unlearning: Language Models as Few Shot Unlearners". Martin Pawelczyk, Seth Neel* and Himabindu Lakkaraju*; ICML 2024.
☆28Updated 2 years ago
franciscoliu / SKU
Official code implementation of SKU, Accepted by ACL 2024 Findings
☆18Updated 10 months ago
grasses / PoisonPrompt
Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107
☆17Updated last year
SproutNan / AI-Safety_SCAV
This is the code repository for "Uncovering Safety Risks of Large Language Models through Concept Activation Vector"
☆44Updated this week
zhaojunGUO / Awesome-LLM-Watermark
Watermarking LLM papers up-to-date
☆13Updated last year
umd-huang-lab / VLM-Poisoning
Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"
☆55Updated 9 months ago
thu-coai / JailbreakDefense_GoalPriority
[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
☆29Updated last year
lancopku / codable-watermarking-for-llm
Repository for Towards Codable Watermarking for Large Language Models
☆38Updated 2 years ago
thu-ml / STAIR
Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"
☆75Updated 7 months ago