mignonjia / TS_watermark
☆11Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for TS_watermark
- Repository for Towards Codable Watermarking for Large Language Models☆29Updated last year
- [USENIX Scurity'24] REMARK-LLM: A robust and efficient watermarking framework for generative large language models☆17Updated 3 weeks ago
- ☆9Updated 2 years ago
- multi-bit language model watermarking (NAACL 24)☆11Updated last month
- ☆30Updated 3 months ago
- ☆37Updated 3 months ago
- 😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.☆129Updated last week
- ☆17Updated 3 weeks ago
- Official Repo of ICLR 24 BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models☆16Updated 3 months ago
- Accepted by ECCV 2024☆73Updated 3 weeks ago
- A survey on harmful fine-tuning attack for large language model☆70Updated this week
- Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024☆27Updated 5 months ago
- Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"☆20Updated last year
- A curated list of trustworthy Generative AI papers. Daily updating...☆67Updated 2 months ago
- Repo for SemStamp (NAACL2024) and k-SemStamp (ACL2024)☆10Updated 3 months ago
- UP-TO-DATE LLM Watermark paper. 🔥🔥🔥☆286Updated 5 months ago
- ☆35Updated 9 months ago
- ☆21Updated 5 months ago
- Code for paper: "PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification", IEEE S&P 2024.☆28Updated 3 months ago
- [ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of LLM Watermarks☆18Updated 11 months ago
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆11Updated last year
- BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models☆72Updated 2 months ago
- ☆18Updated 4 months ago
- Jailbreaking Large Vision-language Models via Typographic Visual Prompts☆85Updated 6 months ago
- ☆25Updated 5 months ago
- ☆20Updated last year
- Code to generate NeuralExecs (prompt injection for LLMs)☆16Updated 3 months ago
- "In-Context Unlearning: Language Models as Few Shot Unlearners". Martin Pawelczyk, Seth Neel* and Himabindu Lakkaraju*; ICML 2024.☆15Updated last year
- Official repository of the paper: Who Wrote this Code? Watermarking for Code Generation (ACL 2024)☆29Updated 5 months ago
- [ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks☆12Updated 9 months ago