☆69Sep 30, 2025Updated 7 months ago
Alternatives and similar repositories for LlavaGuard
Users that are interested in LlavaGuard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.☆87Jan 19, 2025Updated last year
- ☆35May 22, 2024Updated last year
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆60Jul 21, 2025Updated 9 months ago
- Fine-tuning-free Shapley value (FreeShap) for instance attribution☆14May 29, 2024Updated last year
- [COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and fur…☆90May 9, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- ☆25May 28, 2025Updated 11 months ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆27Nov 25, 2025Updated 5 months ago
- [CVPR 2025] Official implementation for JOOD "Playing the Fool: Jailbreaking LLMs and Multimodal LLMs with Out-of-Distribution Strategy"☆22Jun 11, 2025Updated 10 months ago
- [CVPR2025] T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation☆33Jul 10, 2025Updated 9 months ago
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆73Feb 9, 2026Updated 2 months ago
- [NeurIPS 2023] Official PyTorch implementation for the paper "CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganog…☆11Sep 28, 2023Updated 2 years ago
- [ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…☆81Jun 6, 2024Updated last year
- R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning, ECCV2022 [PyTorch Code]☆14Sep 19, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation. AAAI, 2025☆14Aug 25, 2025Updated 8 months ago
- [ICLR 2025] PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"☆31Jul 20, 2025Updated 9 months ago
- Several approaches using deep reinforcement learning to play Super Mario Bros.☆10Aug 28, 2019Updated 6 years ago
- ☆14Jul 2, 2024Updated last year
- Code to break Llama Guard☆32Dec 7, 2023Updated 2 years ago
- ☆160Aug 9, 2022Updated 3 years ago
- [ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors☆19Jul 11, 2024Updated last year
- Code for paper OmniSSR☆25Apr 21, 2025Updated last year
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.☆50Aug 21, 2025Updated 8 months ago
- [CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models☆385Jan 8, 2026Updated 3 months ago
- Official Implementation of Safe Latent Diffusion for Text2Image☆97Apr 21, 2023Updated 3 years ago
- ☆27May 6, 2024Updated last year
- ☆34Jul 18, 2024Updated last year
- [NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"☆12Dec 20, 2024Updated last year
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆30Apr 2, 2025Updated last year
- Experiments for our CLEAR benchmark of unlearning methods in a multimodal setup☆23Aug 6, 2025Updated 8 months ago
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆50Jun 3, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [TMLR'25] AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public saf…☆55Nov 20, 2025Updated 5 months ago
- LangChain Agent☆11Nov 25, 2025Updated 5 months ago
- Filipino multi-modal NLP dataset. Consists of 350k+ Filipino news articles and associated images☆14Mar 11, 2025Updated last year
- ☆13Jun 4, 2025Updated 11 months ago
- A simple template for theoretical computer science assignments☆11Sep 6, 2023Updated 2 years ago
- ☆21May 28, 2025Updated 11 months ago
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking☆13Apr 11, 2025Updated last year