xiaoniu-578fa6bff964d005 / UnbiasedWatermark
☆27Updated last month
Related projects: ⓘ
- UP-TO-DATE LLM Watermark paper. 🔥🔥🔥☆253Updated 3 months ago
- [ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of LLM Watermarks☆17Updated 10 months ago
- Repository for Towards Codable Watermarking for Large Language Models☆26Updated last year
- 😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.☆73Updated this week
- ☆18Updated 3 months ago
- Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024☆23Updated 3 months ago
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆25Updated 3 months ago
- Code for watermarking language models☆69Updated 2 weeks ago
- Accepted by ECCV 2024☆59Updated 2 months ago
- MarkLLM: An Open-Source Toolkit for LLM Watermarking.☆246Updated last month
- Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"☆17Updated 10 months ago
- A curated list of trustworthy Generative AI papers. Daily updating...☆67Updated 2 weeks ago
- Robust natural language watermarking using invariant features☆25Updated 11 months ago
- Official Repo of ICLR 24 BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models☆11Updated last month
- ☆63Updated 10 months ago
- Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"☆41Updated 7 months ago
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆37Updated 2 years ago
- [USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models☆61Updated last week
- Code and data for our paper "Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark"…☆47Updated last year
- LLM Unlearning☆112Updated 11 months ago
- Accepted by IJCAI-24 Survey Track☆117Updated 3 weeks ago
- ☆23Updated 3 months ago
- Official repository of the paper: Who Wrote this Code? Watermarking for Code Generation (ACL 2024)☆23Updated 3 months ago
- [arXiv:2311.03191] "DeepInception: Hypnotize Large Language Model to Be Jailbreaker"☆109Updated 7 months ago
- A collection of automated evaluators for assessing jailbreak attempts.☆55Updated 2 months ago
- A resource repository for machine unlearning in large language models☆131Updated this week
- [ICLR 2024] Provable Robust Watermarking for AI-Generated Text☆25Updated 9 months ago
- Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).☆16Updated 2 months ago
- The lastest paper about detection of LLM-generated text and code☆195Updated last week
- An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)☆148Updated last year