xiaoniu-578fa6bff964d005 / UnbiasedWatermarkLinks
☆40Updated last year
Alternatives and similar repositories for UnbiasedWatermark
Users that are interested in UnbiasedWatermark are comparing it to the libraries listed below
Sorting:
- Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"☆31Updated 2 years ago
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆37Updated last year
- This is the code repository for "Uncovering Safety Risks of Large Language Models through Concept Activation Vector"☆47Updated 3 months ago
- A survey on harmful fine-tuning attack for large language model☆231Updated 3 weeks ago
- ☆22Updated last year