Twilight92z / Quantize-Watermark
☆20Updated last year
Alternatives and similar repositories for Quantize-Watermark:
Users that are interested in Quantize-Watermark are comparing it to the libraries listed below
- Codebase for decoding compressed trust.☆22Updated 8 months ago
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)☆53Updated last week
- Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]☆60Updated 3 months ago
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆15Updated 8 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆35Updated 2 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆63Updated 6 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆88Updated 7 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆34Updated 3 months ago
- This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)☆32Updated 2 months ago
- ☆16Updated last month
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆53Updated 3 months ago
- ☆28Updated 3 months ago
- Awesome-Low-Rank-Adaptation☆62Updated 3 months ago
- ☆45Updated 6 months ago
- ☆13Updated 3 months ago
- Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks☆22Updated 6 months ago
- The official code of the paper "A Closer Look at Machine Unlearning for Large Language Models".☆20Updated last month
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"☆83Updated 4 months ago
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Updated last year
- A block pruning framework for LLMs.☆15Updated 6 months ago