BrianPulfer / LMWatermarkLinks

Implementation of 'A Watermark for Large Language Models' paper by Kirchenbauer & Geiping et. al.

☆24

Alternatives and similar repositories for LMWatermark

Users that are interested in LMWatermark are comparing it to the libraries listed below

Sorting:

jthickstun / watermark
Code for watermarking language models
☆84Updated last year
facebookresearch / three_bricks
Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"
☆50Updated last year
Vaidehi99 / InfoDeletionAttacks
☆48Updated 9 months ago
weichen-yu / LM-Extraction
☆43Updated 2 years ago
amazon-science / controlling-llm-memorization
☆38Updated 2 years ago
azshue / AutoPoison
The official repository of the paper "On the Exploitability of Instruction Tuning".
☆65Updated last year
THU-BPM / Robust_Watermark
Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.
☆35Updated last year
pratyushmaini / llm_dataset_inference
Official Repository for Dataset Inference for LLMs
☆43Updated last year
franciscoliu / SKU
Official code implementation of SKU, Accepted by ACL 2024 Findings
☆20Updated 11 months ago
THU-BPM / unforgeable_watermark
Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024
☆34Updated last year
XuandongZhao / DRW
[EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP
☆13Updated 2 years ago
hlzhang109 / impossibility-watermark
[ICML 2024] Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models
☆24Updated last year
franciscoliu / Awesome-GenAI-Unlearning
☆176Updated last week
Kiode / Text_Watermark
Watermarking Text Generated by Black-Box Language Models
☆40Updated last year
skywalker023 / confaide
🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…
☆49Updated last year
hzy312 / Awesome-LLM-Watermark
UP-TO-DATE LLM Watermark paper. 🔥🔥🔥
☆365Updated 11 months ago
poloclub / llm-landscape
NeurIPS'24 - LLM Safety Landscape
☆33Updated last month
LukasStruppek / Rickrolling-the-Artist
[ICCV 2023] Source code for our paper "Rickrolling the Artist: Injecting Invisible Backdoors into Text-Guided Image Generation Models".
☆65Updated 2 years ago
ndb796 / MachineUnlearning
Towards Machine Unlearning Benchmarks: Forgetting the Personal Identities in Facial Recognition Systems
☆64Updated 6 months ago
Jayfeather1024 / Backdoor-Enhanced-Alignment
☆24Updated 11 months ago
xiaoniu-578fa6bff964d005 / UnbiasedWatermark
☆40Updated last year
Allen-piexl / JailbreakZoo
☆153Updated last year
yunqing-me / WatermarkDM
Code of the paper: A Recipe for Watermarking Diffusion Models
☆154Updated last year
VITA-Group / DP-OPT
[ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
☆46Updated last year
lilakk / PostMark
Official repository for "PostMark: A Robust Blackbox Watermark for Large Language Models"
☆27Updated last year
yaojin17 / Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆64Updated last year
XuandongZhao / pf-decoding
[ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs
☆19Updated 8 months ago
papersPapers / BadPrompt
Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"
☆40Updated last year
UCSB-NLP-Chang / SelfDenoise
☆14Updated last year
yjw1029 / Self-Reminder
Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.
☆53Updated 2 years ago