hzy312 / Awesome-LLM-Watermark
UP-TO-DATE LLM Watermark paper. π₯π₯π₯
β322Updated 2 months ago
Alternatives and similar repositories for Awesome-LLM-Watermark:
Users that are interested in Awesome-LLM-Watermark are comparing it to the libraries listed below
- A survey on harmful fine-tuning attack for large language modelβ135Updated last week
- The lastest paper about detection of LLM-generated text and codeβ247Updated last month
- β34Updated 6 months ago
- Code for watermarking language modelsβ75Updated 5 months ago
- A resource repository for machine unlearning in large language modelsβ307Updated last week
- Repository for Towards Codable Watermarking for Large Language Modelsβ34Updated last year
- LLM Unlearningβ141Updated last year
- Landing Page for TOFUβ113Updated this week
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.β84Updated 8 months ago
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"β84Updated 5 months ago
- π up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.β206Updated this week
- multi-bit language model watermarking (NAACL 24)β11Updated 4 months ago
- β553Updated 11 months ago
- Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"β45Updated last year
- γACL 2024γ SALAD benchmark & MD-Judgeβ123Updated 2 months ago
- [ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modificationsβ68Updated 4 months ago
- β16Updated 10 months ago
- [ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of LLM Watermarksβ22Updated last year
- Official repository of the paper: Who Wrote this Code? Watermarking for Code Generation (ACL 2024)β32Updated 8 months ago
- Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024β31Updated 8 months ago
- β45Updated 7 months ago
- Repo for SemStamp (NAACL2024) and k-SemStamp (ACL2024)β17Updated 2 months ago
- Accepted by IJCAI-24 Survey Trackβ190Updated 5 months ago
- β109Updated 5 months ago
- β72Updated last week
- Accepted by ECCV 2024β94Updated 4 months ago
- This is the code repository of our submission: Understanding the Dark Side of LLMsβ Intrinsic Self-Correction.β55Updated last month
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuningβ88Updated 8 months ago
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decodingβ116Updated 6 months ago
- MarkLLM: An Open-Source Toolkit for LLM Watermarking.οΌEMNLP 2024 DemoοΌβ330Updated 2 weeks ago