DSN-2024/DSN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DSN-2024/DSN)

DSN-2024 / DSN

DSN jailbreak Attack & Evaluation Ensemble

☆17

Alternatives and similar repositories for DSN

Users that are interested in DSN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TrustAIRLab / JailbreakRadar
View on GitHub
☆87Jun 8, 2025Updated 11 months ago
itsvaibhav01 / Immune
View on GitHub
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
☆28Jun 11, 2025Updated 11 months ago
ledllm / ledllm
View on GitHub
☆25Jun 16, 2024Updated last year
qihangGH / probabilistic_melt_pool_model
View on GitHub
[TASE 2024] Official implementation of the paper "Probabilistic Data-Driven Modeling of a Melt Pool in Laser Powder Bed Fusion Additive M…
☆12Jul 14, 2025Updated 10 months ago
quicksviewer / quicksviewer
View on GitHub
☆19Jun 29, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
angel-ayala / gym-webots-drone
View on GitHub
Webots scene gym environment for drone navigation tasks methods
☆13Sep 2, 2025Updated 8 months ago
qihangGH / IMRC
View on GitHub
[AAAI2024] Official implementation of Evaluate Geometry of Radiance Fields with Low-frequency Color Prior
☆17Jun 25, 2024Updated last year
listen0425 / Safety-Layers
View on GitHub
code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)
☆24Apr 26, 2025Updated last year
jackyyang9 / MLPHand
View on GitHub
☆19Jan 7, 2026Updated 4 months ago
SongZihui-sudo / Mechanical-design-solver
View on GitHub
一个机械设计课设的计算器，可以计算出包括电动机，传动装置，V带轮，齿轮，轴，轴承的几何或者力，运动学参数数值。
☆19Jan 5, 2023Updated 3 years ago
CHATS-lab / LLMs_Encode_Harmfulness_Refusal_Separately
View on GitHub
☆32Dec 14, 2025Updated 5 months ago
SproutNan / AI-Safety_Benchmark
View on GitHub
The official repository for guided jailbreak benchmark
☆29Jul 28, 2025Updated 9 months ago
yuanc3 / DATE
View on GitHub
Use 2 lines to empower absolute time awareness for Qwen2.5VL's MRoPE
☆29Sep 20, 2025Updated 7 months ago
Redempt1onzzZZ / FUTURE
View on GitHub
[ICSE 2025] The Seeds of the FUTURE Sprout from History: Fuzzing for Unveiling Vulnerabilities in Prospective Deep-Learning Libraries (AC…
☆22Dec 22, 2025Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mathpopo / Llama2-Chinese
View on GitHub
Llama中文社区，最好的中文Llama大模型，完全开源可商用
☆12Aug 5, 2023Updated 2 years ago
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆20Dec 14, 2025Updated 5 months ago
percent4 / llama-2-multiple-choice-mrc
View on GitHub
本项目采用Firefly模型训练框架，使用LLAMA-2模型对多项选择阅读理解任务（Multiple Choice MRC）进行微调，取得了显著的进步。
☆11Sep 16, 2023Updated 2 years ago
showlab / FocusUI
View on GitHub
[CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
☆32Feb 10, 2026Updated 3 months ago
kaistAI / How-Well-Do-LLMs-Truly-Ground
View on GitHub
☆11Sep 19, 2025Updated 8 months ago
changwxx / ShanghaiTech-poster-template
View on GitHub
A self-made NeurIPS poster template, infused with the unique design style of ShanghaiTech.
☆16Dec 26, 2023Updated 2 years ago
Infini-AI-Lab / M2PO
View on GitHub
☆30Oct 8, 2025Updated 7 months ago
casperllm / CASPER
View on GitHub
☆16Apr 27, 2024Updated 2 years ago
ydyjya / LLM-IHS-Explanation
View on GitHub
☆60Jun 13, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
feruxmax / meltdown
View on GitHub
Meltdown/Spectre experiments
☆54Jan 5, 2018Updated 8 years ago
adithya-s-k / MoLE
View on GitHub
Mixture of Lora Experts
☆11Apr 7, 2024Updated 2 years ago
Yiwei98 / ESC
View on GitHub
☆14Jul 17, 2025Updated 10 months ago
yuplin2333 / representation-space-jailbreak
View on GitHub
Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…
☆24Jul 26, 2024Updated last year
Hongyang-Du / VideoGPA
View on GitHub
[ICML'26] VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.
☆54May 9, 2026Updated last week
thunxxx / MLLM-Jailbreak-evaluation-MMJ-Bench
View on GitHub
☆77Mar 30, 2025Updated last year
facebookresearch / HalluLens
View on GitHub
Codebase for LLM Textual Hallucination Benchmark
☆80Apr 25, 2025Updated last year
princeton-pli / QRHead
View on GitHub
QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking
☆38Jan 20, 2026Updated 3 months ago
NLie2 / what_features_jailbreak_LLMs
View on GitHub
☆18Mar 30, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MaTengSYSU / HIMRD-jailbreak
View on GitHub
Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"
☆17Aug 7, 2025Updated 9 months ago
eric-ai-lab / MSSBench
View on GitHub
[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"
☆35Jun 23, 2025Updated 10 months ago
RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
View on GitHub
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆36Jun 1, 2025Updated 11 months ago
SaFo-Lab / AdaShield
View on GitHub
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…
☆73Feb 9, 2026Updated 3 months ago
caojiantao / love-note
View on GitHub
恋爱记事本，一款轻便记录情侣日常生活的小程序。
☆19Dec 28, 2023Updated 2 years ago
huaixuheqing / VPPO-RL
View on GitHub
[ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"
☆62Apr 3, 2026Updated last month
paul-rottger / xstest
View on GitHub
Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
☆133Feb 24, 2025Updated last year