pipilurj/MLLM-protector

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pipilurj/MLLM-protector)

pipilurj / MLLM-protector

The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"

☆46

Alternatives and similar repositories for MLLM-protector

Users that are interested in MLLM-protector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pipilurj / bootstrapped-preference-optimization-BPO
View on GitHub
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"
☆63Aug 23, 2024Updated last year
xyq7 / Human-Contribution-Measurement
View on GitHub
☆13Jun 4, 2025Updated last year
W-Ted / UDC-NeRF
View on GitHub
Official code for ICCV2023 paper: Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis
☆34Dec 27, 2023Updated 2 years ago
pipilurj / ROBOT
View on GitHub
☆27Apr 11, 2023Updated 3 years ago
pipilurj / G-LLaVA
View on GitHub
Official github repo of G-LLaVA
☆154Feb 20, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sterzhang / PVIT
View on GitHub
Official Repository of Personalized Visual Instruct Tuning
☆34Mar 6, 2025Updated last year
riejohnson / cfg-gan
View on GitHub
CFG-GAN: Composite functional gradient learning of generative adversarial models
☆15Jul 9, 2020Updated 6 years ago
rt219 / LatentGuard
View on GitHub
This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"
☆54Oct 24, 2024Updated last year
gyhdog99 / ECSO
View on GitHub
ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)
☆37Nov 2, 2024Updated last year
EchoseChen / SPA-VL-RLHF
View on GitHub
The reinforcement learning codes for dataset SPA-VL
☆48Jun 24, 2024Updated 2 years ago
SaFo-Lab / AdaShield
View on GitHub
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…
☆73Feb 9, 2026Updated 5 months ago
DripNowhy / ETA
View on GitHub
[ICLR 2025] PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"
☆34Jul 20, 2025Updated last year
xyq7 / GradSafe
View on GitHub
Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"
☆68Oct 27, 2024Updated last year
yjw1029 / Self-Reminder
View on GitHub
Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.
☆57Nov 13, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
pipilurj / DynaFed
View on GitHub
☆50Apr 1, 2023Updated 3 years ago
kangmintong / R-2-Guard
View on GitHub
[ICLR 2025] Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning
☆23Jul 8, 2024Updated 2 years ago
SumilerGAO / SunGen
View on GitHub
☆28Feb 26, 2023Updated 3 years ago
shizhediao / Black-Box-Prompt-Learning
View on GitHub
Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"
☆59Sep 7, 2023Updated 2 years ago
itsvaibhav01 / Immune
View on GitHub
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
☆28Jun 11, 2025Updated last year
sterzhang / image-textualization
View on GitHub
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)
☆172Jul 30, 2024Updated last year
hanningzhang / prm
View on GitHub
☆17Nov 3, 2024Updated last year
isXinLiu / MM-SafetyBench
View on GitHub
Accepted by ECCV 2024
☆218Oct 15, 2024Updated last year
EngineeringSoftware / inlinetest
View on GitHub
Tests that check correctness of a single statement
☆14Jun 3, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
2003pro / ScaleBiO
View on GitHub
This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting
☆25Jul 30, 2024Updated last year
prismformore / SDSEN
View on GitHub
☆20May 26, 2020Updated 6 years ago
AI45Lab / VLSBench
View on GitHub
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
☆62Jul 21, 2025Updated last year
shizhediao / Post-Training-Data-Flywheel
View on GitHub
We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.
☆66Oct 3, 2024Updated last year
longvideoagent / LongVideoAgent
View on GitHub
☆120Apr 8, 2026Updated 3 months ago
yangcaoai / CoDA_NeurIPS2023
View on GitHub
Official code for NeurIPS2023 paper CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detec…
☆223May 28, 2026Updated last month
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
TrustAIRLab / VoiceJailbreakAttack
View on GitHub
Code for Voice Jailbreak Attacks Against GPT-4o.
☆38May 31, 2024Updated 2 years ago
thunxxx / MLLM-Jailbreak-evaluation-MMJ-Bench
View on GitHub
☆80Mar 30, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
lishangyu-hkust / OSVBench
View on GitHub
(AAAI 2026) OSVBench, a new benchmark for evaluating Large Language Models (LLMs) in generating complete specification code pertaining to…
☆15May 13, 2025Updated last year
PandragonXIII / CIDER
View on GitHub
This is the official repository for Cross-modality Information Check for Detecting Jailbreaking in Multimodal Large Language Models.
☆15Jan 16, 2025Updated last year
extreme-bert / extreme-bert
View on GitHub
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “E…
☆268Mar 5, 2023Updated 3 years ago
CryptoAILab / FigStep
View on GitHub
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆211Jun 26, 2025Updated last year
OptimalScale / DetGPT
View on GitHub
☆786Aug 7, 2024Updated last year
zhongyingji / CVT-xRF
View on GitHub
CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs (CVPR2024)
☆17Jun 14, 2024Updated 2 years ago
SilentView / EMCID
View on GitHub
Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"
☆19Mar 21, 2024Updated 2 years ago