Code for USENIX Security 2025 paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation"
☆220Jan 27, 2026Updated last month
Alternatives and similar repositories for JBShield
Users that are interested in JBShield are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆164Mar 31, 2025Updated 11 months ago
- ☆145Aug 14, 2024Updated last year
- High-efficiency Secure Two Party Computation on GPU☆172Apr 1, 2025Updated 11 months ago
- The implementation of our AAAI 2024 paper "Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought"☆196Apr 5, 2025Updated 11 months ago
- 基于IFTTT平台的隐私挖掘工具☆51Mar 27, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 本项目基于兼具加密与计算双重能力的全同态加密算法、利用微软开源库Microsoft-Seal而设计出的一套能够保护医疗数据的云计算系统。☆62Mar 31, 2025Updated 11 months ago
- ☆73May 23, 2025Updated 10 months ago
- A secure IoT authentication framework based on hardware fingerprinting☆156Mar 1, 2025Updated last year
- ☆142Mar 31, 2025Updated 11 months ago
- ☆145Mar 31, 2025Updated 11 months ago
- ☆143Mar 2, 2025Updated last year
- SimdMSM: SIMD-accelerated Multi-Scalar Multiplication Framework for zkSNARKs☆162Apr 21, 2025Updated 11 months ago
- efficient anti side channel SHA3 algorithm software and hardware co-design☆154Apr 21, 2025Updated 11 months ago
- [开源软件发布]基于蓝牙的病毒追踪系统,采用BLE低功耗蓝牙,通过SM3加密认证保护用户数据安全性,提供包括Android开发,IOS开发,以及Java服务器开发的完整代码和直接可以运行的apk文件☆150Jul 11, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for CVPR 2024 "Revisiting Adversarial Training under Long-Tailed Distributions".☆157Mar 1, 2025Updated last year
- datacon比赛2024年漏洞分析赛道解题框架与运行镜像压缩包☆182Jun 10, 2025Updated 9 months ago
- WHU大二 计算机设计 流水线CPU设计 课程作业☆13Mar 11, 2025Updated last year
- A rl-based waf bypass tool☆245Mar 29, 2025Updated 11 months ago
- 国密算法的纯 Python 实现.☆303Jan 11, 2026Updated 2 months ago
- [ICML 2025] 🧬 ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation☆81Feb 12, 2026Updated last month
- 武汉大学课程资料整理-WHU课代表计划☆1,586Mar 11, 2026Updated 2 weeks ago
- ☆16Apr 3, 2025Updated 11 months ago
- Implementation of an X86 mini OS from scratch. Reference: https://github.com/yyu/osfs00☆11Jan 9, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"☆22May 6, 2025Updated 10 months ago
- A lightweight library for large laguage model (LLM) jailbreaking defense.☆61Sep 11, 2025Updated 6 months ago
- A list of recent adversarial attack and defense papers (including those on large language models)☆45Mar 18, 2026Updated last week
- Runflow is a tool to define and run workflows.☆10Jul 13, 2021Updated 4 years ago
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆151Jul 19, 2024Updated last year
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization☆29Jul 9, 2024Updated last year
- ☆226Aug 17, 2025Updated 7 months ago
- Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"☆66Oct 27, 2024Updated last year
- Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers☆66Aug 25, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"☆107May 20, 2025Updated 10 months ago
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Apr 24, 2024Updated last year
- KENKU: Towards Efficient and Stealthy Black-box Adversarial Attacks against ASR Systems☆19Oct 3, 2023Updated 2 years ago
- ☆12Nov 10, 2020Updated 5 years ago
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆434Jan 22, 2025Updated last year
- ☆76Mar 30, 2025Updated 11 months ago
- ☆39May 17, 2025Updated 10 months ago