NISPLab/JBShield

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NISPLab/JBShield)

NISPLab / JBShield

Code for USENIX Security 2025 paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation"

☆223

Alternatives and similar repositories for JBShield

Users that are interested in JBShield are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CongGroup / Poisoning-SSL-based-RS
View on GitHub
☆161Mar 31, 2025Updated last year
liqiao95 / DUCD
View on GitHub
☆143Aug 14, 2024Updated last year
wanrenmi / IFTTT_privacy_mining
View on GitHub
基于IFTTT平台的隐私挖掘工具
☆51Mar 27, 2025Updated last year
hututu2 / MediFHECloudPlatform
View on GitHub
本项目基于兼具加密与计算双重能力的全同态加密算法、利用微软开源库Microsoft-Seal而设计出的一套能够保护医疗数据的云计算系统。
☆62Mar 31, 2025Updated last year
DataAvailable / VULOC
View on GitHub
☆76May 23, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
IoTAccessControl / MCU-Token
View on GitHub
A secure IoT authentication framework based on hardware fingerprinting
☆157Mar 1, 2025Updated last year
CircuitMurderer / MPC-Platform
View on GitHub
MPC(Multi-Party Computation) all in one.
☆142Jan 26, 2026Updated 5 months ago
Yiruma96 / MLARandom-repo
View on GitHub
☆152Apr 28, 2025Updated last year
wchuanmu / RfTPM
View on GitHub
☆143Mar 31, 2025Updated last year
JR-account / SimdMSM
View on GitHub
SimdMSM: SIMD-accelerated Multi-Scalar Multiplication Framework for zkSNARKs
☆162Apr 21, 2025Updated last year
zllwhu / VTASChannel-server
View on GitHub
☆149Mar 31, 2025Updated last year
lunan0320 / Pioneer
View on GitHub
[开源软件发布]基于蓝牙的病毒追踪系统，采用BLE低功耗蓝牙，通过SM3加密认证保护用户数据安全性，提供包括Android开发，IOS开发，以及Java服务器开发的完整代码和直接可以运行的apk文件
☆150Jul 11, 2025Updated last year
Favorsiki / SecSHA3
View on GitHub
efficient anti side channel SHA3 algorithm software and hardware co-design
☆153Apr 21, 2025Updated last year
jayangcs / SketchSeger
View on GitHub
☆140Apr 1, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Garytop / rv32-pipeline-cpu
View on GitHub
WHU大二计算机设计流水线CPU设计课程作业
☆13Mar 11, 2025Updated last year
Cytmo / waf-prowler
View on GitHub
A rl-based waf bypass tool
☆246Mar 29, 2025Updated last year
powchan / SCCPU_sim_RV32I
View on GitHub
☆24Apr 3, 2025Updated last year
AngxiaoYue / ReQFlow
View on GitHub
[ICML 2025] 🧬 ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation
☆84Feb 12, 2026Updated 5 months ago
Waldenth / WHU-Operating-System-Concepts
View on GitHub
WHU-武汉大学-操作系统概念-课程资料与习题解答
☆33Mar 22, 2021Updated 5 years ago
powchan / PLCPU_sim_RV32I
View on GitHub
☆17Apr 3, 2025Updated last year
xinyangli / orange-os
View on GitHub
Implementation of an X86 mini OS from scratch. Reference: https://github.com/yyu/osfs00
☆11Jan 9, 2023Updated 3 years ago
YihanWang617 / llm-jailbreaking-defense
View on GitHub
A lightweight library for large laguage model (LLM) jailbreaking defense.
☆61Sep 11, 2025Updated 10 months ago
pinczakko / nsa_bios_backdoor_articles
View on GitHub
PDF files of my articles on NSA BIOS backdoor
☆25Nov 29, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
PKU-ML / PAT
View on GitHub
Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"
☆22May 6, 2025Updated last year
shenyizg / NewAdversarialAttackPaper
View on GitHub
A list of recent adversarial attack and defense papers (including those on large language models)
☆44Mar 18, 2026Updated 4 months ago
whuAdv / AdvPattern
View on GitHub
☆10Mar 6, 2020Updated 6 years ago
ZJUICSR / AIcert
View on GitHub
☆228Aug 17, 2025Updated 11 months ago
uw-nsl / SafeDecoding
View on GitHub
Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
☆154Jul 19, 2024Updated 2 years ago
thu-coai / JailbreakDefense_GoalPriority
View on GitHub
[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
☆29Jul 9, 2024Updated 2 years ago
xirui-li / DrAttack
View on GitHub
Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
☆68Aug 25, 2024Updated last year
leigest519 / HiddenDetect
View on GitHub
ACL 2025 (Main) HiddenDetect: Detecting Jailbreak Attacks against Multimodal Large Language Models via Monitoring Hidden States
☆165Jun 8, 2025Updated last year
kztakemoto / simbaja
View on GitHub
All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks
☆17Apr 24, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
xyq7 / GradSafe
View on GitHub
Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"
☆68Oct 27, 2024Updated last year
Xinghui-Wu / KENKU
View on GitHub
KENKU: Towards Efficient and Stealthy Black-box Adversarial Attacks against ASR Systems
☆19Oct 3, 2023Updated 2 years ago
tristartom / sgx-emulator
View on GitHub
An Emulator and SDK for Intel SGX extension
☆32Mar 6, 2017Updated 9 years ago
chujiezheng / LLM-Safeguard
View on GitHub
Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"
☆108May 20, 2025Updated last year
mignonjia / TS_watermark
View on GitHub
☆16May 11, 2025Updated last year
SheltonLiu-N / AutoDAN
View on GitHub
[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…
☆453Jan 22, 2025Updated last year
Waldenth / WHU-Cryptography-experiment
View on GitHub
WHU-武汉大学-国家网络安全学院-信息安全-密码学实验
☆13Mar 24, 2021Updated 5 years ago