ml-research / LlavaGuard
☆33Updated 5 months ago
Alternatives and similar repositories for LlavaGuard:
Users that are interested in LlavaGuard are comparing it to the libraries listed below
- [ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.☆52Updated last week
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆76Updated last year
- ☆15Updated 11 months ago
- Code for T-MARS data filtering☆35Updated last year
- ☆27Updated last year
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆31Updated 2 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View☆46Updated 3 months ago
- Codebase for decoding compressed trust.☆22Updated 8 months ago
- [ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …☆21Updated 3 months ago
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆32Updated 9 months ago