AI45Lab / X-BoundaryLinks
The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability"
☆35Updated 5 months ago
Alternatives and similar repositories for X-Boundary
Users that are interested in X-Boundary are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆49Updated last month
- Accepted by ECCV 2024☆149Updated 10 months ago
- Accepted by IJCAI-24 Survey Track☆212Updated last year
- Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …☆69Updated last week
- ☆60Updated 5 months ago
- ☆51Updated last year
- Official repository for "Safety in Large Reasoning Models: A Survey" - Exploring safety risks, attacks, and defenses for Large Reasoning …☆66Updated last week
- ☆135Updated 6 months ago
- Official repository of RiOSWorld☆34Updated last month
- Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"☆68Updated 6 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆121Updated last month
- The reinforcement learning codes for dataset SPA-VL☆37Updated last year