THU-BPM / MarkLLMLinks

MarkLLM: An Open-Source Toolkit for LLM Watermarking.（EMNLP 2024 System Demonstration)

☆659

Alternatives and similar repositories for MarkLLM

Users that are interested in MarkLLM are comparing it to the libraries listed below

Sorting:

hzy312 / Awesome-LLM-Watermark
UP-TO-DATE LLM Watermark paper. 🔥🔥🔥
☆363Updated 11 months ago
thu-coai / AISafetyLab
AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.
☆212Updated 2 months ago
xiaoniu-578fa6bff964d005 / UnbiasedWatermark
☆40Updated last year
SaFoLab-WISC / AutoDAN-Turbo
[ICLR 2025 Spotlight] The official implementation of our ICLR2025 paper "AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to…
☆321Updated last month
Allen-piexl / JailbreakZoo
☆153Updated last year
EasyJailbreak / EasyJailbreak
An easy-to-use Python framework to generate adversarial jailbreak prompts.
☆756Updated 7 months ago
jiaxiaojunQAQ / I-GCG
Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)
☆133Updated 7 months ago
liudaizong / Awesome-LVLM-Attack
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
☆429Updated last week
lancopku / codable-watermarking-for-llm
Repository for Towards Codable Watermarking for Large Language Models
☆38Updated 2 years ago
SproutNan / AI-Safety_SCAV
This is the code repository for "Uncovering Safety Risks of Large Language Models through Concept Activation Vector"
☆46Updated last month
IAAR-Shanghai / SafeRAG
☆49Updated 8 months ago
NY1024 / RACE
☆25Updated 8 months ago
liuxuannan / Awesome-Multimodal-Jailbreak
A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models
☆261Updated 2 weeks ago
jwkirchenbauer / lm-watermarking
☆643Updated 2 months ago
sleeepeer / PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
☆218Updated this week
THU-BPM / Robust_Watermark
Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.
☆34Updated last year
yueliu1999 / GuardReasoner
[ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".
☆160Updated 6 months ago
KodCode-AI / kodcode
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
☆289Updated 2 months ago
xiaojunxu / learning-to-watermark-llm
☆21Updated last year
NJUNLP / ReNeLLM
The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Lang…
☆145Updated 2 months ago
LetterLiGo / SafeGen_CCS2024
[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models
☆137Updated 4 months ago
THU-BPM / unforgeable_watermark
Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024
☆34Updated last year
isXinLiu / Awesome-MLLM-Safety
Accepted by IJCAI-24 Survey Track
☆223Updated last year
AI45Lab / ActorAttack
☆111Updated 9 months ago
XuankunRong / Awesome-LVLM-Safety
A curated list of resources dedicated to the safety of Large Vision-Language Models. This repository aligns with our survey titled A Surv…
☆165Updated last month
bangawayoo / mb-lm-watermarking
multi-bit language model watermarking (NAACL 24)
☆17Updated last year
OpenSafetyLab / SALAD-BENCH
【ACL 2024】 SALAD benchmark & MD-Judge
☆166Updated 8 months ago
PKU-YuanGroup / Reasoning-Attack
☆136Updated 8 months ago
git-disl / awesome_LLM-harmful-fine-tuning-papers
A survey on harmful fine-tuning attack for large language model
☆220Updated last week
xingjunm / Awesome-Large-Model-Safety
Safety at Scale: A Comprehensive Survey of Large Model Safety
☆204Updated 9 months ago