☆70Sep 30, 2025Updated 8 months ago
Alternatives and similar repositories for LlavaGuard
Users that are interested in LlavaGuard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35May 22, 2024Updated 2 years ago
- Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode☆17Feb 16, 2025Updated last year
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆62Jul 21, 2025Updated 10 months ago
- Fine-tuning-free Shapley value (FreeShap) for instance attribution☆14May 29, 2024Updated 2 years ago
- [CVPR 2025] Official implementation for "Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbre…☆61Jul 5, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and fur…☆94May 9, 2025Updated last year
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated 2 years ago
- ☆25May 28, 2025Updated last year
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆26Nov 25, 2025Updated 6 months ago
- [CVPR 2025] Official implementation for JOOD "Playing the Fool: Jailbreaking LLMs and Multimodal LLMs with Out-of-Distribution Strategy"☆21Jun 11, 2025Updated last year
- [CVPR2025] T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation☆33Jul 10, 2025Updated 11 months ago
- [NeurIPS 2023] Official PyTorch implementation for the paper "CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganog…☆11Sep 28, 2023Updated 2 years ago
- ☆32Mar 29, 2025Updated last year
- [ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…☆86Jun 6, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning, ECCV2022 [PyTorch Code]☆14Sep 19, 2022Updated 3 years ago
- [IROS'25] COCMT☆12Aug 14, 2025Updated 10 months ago
- Facial-Expression Recognition with Deep Neural Networks☆10Mar 6, 2016Updated 10 years ago
- Reduction Server in Rust☆14Apr 9, 2024Updated 2 years ago
- [ICLR 2025] PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"☆34Jul 20, 2025Updated 10 months ago
- ☆15Jul 2, 2024Updated last year
- Code to break Llama Guard☆32Dec 7, 2023Updated 2 years ago
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- ☆161Aug 9, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Apr 7, 2025Updated last year
- Code for paper OmniSSR☆25Apr 21, 2025Updated last year
- [EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.☆51Aug 21, 2025Updated 9 months ago
- ☆79Mar 30, 2025Updated last year
- [CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models☆385Jan 8, 2026Updated 5 months ago
- Official Implementation of Safe Latent Diffusion for Text2Image☆98Apr 21, 2023Updated 3 years ago
- Data collection from Moltbook for research☆53May 15, 2026Updated 3 weeks ago
- ☆28May 6, 2024Updated 2 years ago
- This is the official repository for paper: cross-modal information flow in multimodal large language models☆44May 21, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆35Jul 18, 2024Updated last year
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆30Apr 2, 2025Updated last year
- Experiments for our CLEAR benchmark of unlearning methods in a multimodal setup☆23Aug 6, 2025Updated 10 months ago
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆51Jun 3, 2025Updated last year
- [TMLR'25] AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public saf…☆55Nov 20, 2025Updated 6 months ago
- ☆200Apr 7, 2025Updated last year
- Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings☆21Sep 1, 2025Updated 9 months ago