shiningrain / JailGuardLinks
☆23Updated 7 months ago
Alternatives and similar repositories for JailGuard
Users that are interested in JailGuard are comparing it to the libraries listed below
Sorting:
- Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"☆56Updated 9 months ago
- ☆53Updated last year
- Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107☆17Updated last year
- ☆63Updated 7 months ago
- [CCS-LAMPS'24] LLM IP Protection Against Model Merging☆15Updated last year
- Reconstructive Neuron Pruning for Backdoor Defense (ICML 2023)☆39Updated last year
- Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"☆20Updated 5 months ago
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning