SaFo-Lab/DoxBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SaFo-Lab/DoxBench)

SaFo-Lab / DoxBench

[ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"

☆26

Alternatives and similar repositories for DoxBench

Users that are interested in DoxBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tele-EVOL / TeleAI-Safety
View on GitHub
☆26Jan 5, 2026Updated 4 months ago
SaFo-Lab / AGrail4Agent
View on GitHub
[ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".
☆39Aug 4, 2025Updated 9 months ago
SaFo-Lab / AdaShield
View on GitHub
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…
☆73Feb 9, 2026Updated 3 months ago
SaFo-Lab / JailBreakV_28K
View on GitHub
[COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and fur…
☆92May 9, 2025Updated last year
SaFo-Lab / DRIFT
View on GitHub
[NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agen…
☆49Apr 19, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SaFo-Lab / MetaAgent
View on GitHub
Offical Repository of MetaAgent Program
☆48Dec 2, 2025Updated 5 months ago
jiaxiaojunQAQ / FP-Better
View on GitHub
Code for Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks (TIFS2024)
☆13Mar 29, 2024Updated 2 years ago
SaFo-Lab / AgentDyn
View on GitHub
The official implementation of the paper "AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World …
☆50May 2, 2026Updated 2 weeks ago
SproutNan / AI-Safety_Benchmark
View on GitHub
The official repository for guided jailbreak benchmark
☆29Jul 28, 2025Updated 9 months ago
GuanlinLee / ART
View on GitHub
Official Code for ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users (NeurIPS 2024)
☆24Oct 23, 2024Updated last year
Sadcardation / ImageProtector
View on GitHub
Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…
☆19Apr 17, 2026Updated last month
thinwayliu / Multimodal-Unlearnable-Examples
View on GitHub
The code for ACM MM2024 (Multimodal Unlearnable Examples: Protecting Data against Multimodal Contrastive Learning)
☆15Jul 18, 2024Updated last year
lvpeizhuo / HufuNet
View on GitHub
This is the source code for HufuNet. Our paper is accepted by the IEEE TDSC.
☆27Aug 21, 2023Updated 2 years ago
INTREBID / Awesome-MM-RAG
View on GitHub
This repository is for our survey paper: "A Comprehensive Survey on Multimodal RAG: All Combinations of Modalities as Input and Output"
☆49Nov 21, 2025Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ndb796 / CelebA-HQ-Face-Identity-and-Attributes-Recognition-PyTorch
View on GitHub
CelebA HQ Face Identity and Attributes Recognition using PyTorch
☆42Nov 3, 2023Updated 2 years ago
conditionWang / NTL
View on GitHub
This is the code of ICLR 2022 Oral paper 'Non-Transferable Learning: A New Approach for Model Ownership Verification and Applicability Au…
☆30Oct 22, 2023Updated 2 years ago
RUCAIBox / HADES
View on GitHub
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆35Oct 23, 2024Updated last year
Z-Zheng / dynamic_highres_poverty
View on GitHub
Dynamic, high-resolution poverty measurement in data-scarce environments
☆11Dec 8, 2024Updated last year
ShawnXYang / TIP-IM
View on GitHub
☆42Mar 11, 2022Updated 4 years ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆14Mar 7, 2026Updated 2 months ago
YancyKahn / CoA
View on GitHub
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
☆39Jan 17, 2025Updated last year
tmllab / 2025_ICLR_PiF
View on GitHub
☆40May 17, 2025Updated last year
Kiode / Text_Watermark
View on GitHub
Watermarking Text Generated by Black-Box Language Models
☆40Dec 9, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AmourWaltz / UAlign
View on GitHub
Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"
☆14Mar 25, 2025Updated last year
ShenzheZhu / JailDAM
View on GitHub
[COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model
☆27Nov 25, 2025Updated 5 months ago
jiaxiaojunQAQ / FGSM-PGI
View on GitHub
Code for Prior-Guided Adversarial Initialization for Fast Adversarial Training (ECCV2022)
☆28Nov 25, 2022Updated 3 years ago
thu-coai / Agent-SafetyBench
View on GitHub
☆136Aug 11, 2025Updated 9 months ago
Social-AI-Studio / MemeCraft
View on GitHub
Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"
☆12Jul 25, 2024Updated last year
Applied-Machine-Learning-Lab / MME-SID
View on GitHub
Code for MME-SID accepted to CIKM 2025 Full Research track.
☆28Oct 29, 2025Updated 6 months ago
YuvalSchwartz / llm-cloud-hunter
View on GitHub
☆17Feb 17, 2025Updated last year
jiaxiaojunQAQ / FGSM-LAW
View on GitHub
Revisiting and Exploring Efficient Fast Adversarial Training via LAW: Lipschitz Regularization and Auto Weight Averaging (TIFS2024)
☆37Jun 4, 2024Updated last year
Huang-yihao / Personalization-based_backdoor
View on GitHub
☆11Dec 18, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wenlinyao / HDFlow
View on GitHub
Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
☆15Oct 4, 2024Updated last year
AlignmentResearch / scaling-poisoning
View on GitHub
☆16Nov 18, 2024Updated last year
kj3moraes / movieclip
View on GitHub
An experiment with movie scenes and contrastive learning
☆11Feb 1, 2025Updated last year
ReaJason / ReaTool
View on GitHub
创造自己的工具集，build for fun🎉
☆17May 13, 2023Updated 3 years ago
Testing4AI / DeepJudge
View on GitHub
Code release for DeepJudge (S&P'22)
☆52Mar 14, 2023Updated 3 years ago
AAAAAAsuka / llm_defends
View on GitHub
code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"
☆14Nov 17, 2023Updated 2 years ago
yuzhaouoe / SAE-based-representation-engineering
View on GitHub
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆80Jan 16, 2026Updated 4 months ago