Code for USENIX Security 2025 paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation"
☆219Jan 27, 2026Updated 3 months ago
Alternatives and similar repositories for JBShield
Users that are interested in JBShield are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆163Mar 31, 2025Updated last year
- ☆144Aug 14, 2024Updated last year
- 基于IFTTT平台的隐私挖掘工具☆51Mar 27, 2025Updated last year
- 本项目基于兼具加密与计算双重能力的全同态加密算法、利用微软开源库Microsoft-Seal而设计出的一套能够保护医疗数据的云计算系统。☆62Mar 31, 2025Updated last year
- ☆75May 23, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆23Mar 13, 2025Updated last year
- A secure IoT authentication framework based on hardware fingerprinting☆157Mar 1, 2025Updated last year
- MPC(Multi-Party Computation) all in one.☆142Jan 26, 2026Updated 3 months ago
- ☆152Apr 28, 2025Updated last year
- ☆143Mar 31, 2025Updated last year
- ☆145Mar 31, 2025Updated last year
- ☆143Mar 2, 2025Updated last year
- efficient anti side channel SHA3 algorithm software and hardware co-design☆154Apr 21, 2025Updated last year
- ☆149Mar 31, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆140Apr 1, 2025Updated last year
- datacon比赛2024年漏洞分析赛道解题框架与运行镜像压缩包☆182Jun 10, 2025Updated 10 months ago
- WHU大二 计算机设计 流水线CPU设计 课程作业☆14Mar 11, 2025Updated last year
- 国密算法的纯 Python 实现.☆307Jan 11, 2026Updated 3 months ago
- [ICML 2025] 🧬 ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation☆84Feb 12, 2026Updated 2 months ago
- 武汉大学课程资料整理-WHU课代表计划☆1,613Apr 6, 2026Updated 3 weeks ago
- Implementation of an X86 mini OS from scratch. Reference: https://github.com/yyu/osfs00☆11Jan 9, 2023Updated 3 years ago
- This repo demonstrates the Return-to-Non-Secure (ret2ns) vulnerability on ARM Cortex-M TrustZone. It contains the attack and defense demo…☆34Oct 30, 2025Updated 6 months ago
- Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"☆22May 6, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ARM TrustZone Audit: Securing vs. Non-Securing Memory Separation☆25Mar 1, 2025Updated last year
- A lightweight library for large laguage model (LLM) jailbreaking defense.☆60Sep 11, 2025Updated 7 months ago
- A list of recent adversarial attack and defense papers (including those on large language models)☆45Mar 18, 2026Updated last month
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆152Jul 19, 2024Updated last year
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization☆29Jul 9, 2024Updated last year
- ☆227Aug 17, 2025Updated 8 months ago
- Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers☆66Aug 25, 2024Updated last year
- Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"☆66Oct 27, 2024Updated last year
- Runflow is a tool to define and run workflows.☆11Jul 13, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"☆108May 20, 2025Updated 11 months ago
- ☆13Jan 28, 2020Updated 6 years ago
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Apr 24, 2024Updated 2 years ago
- ☆17May 11, 2025Updated 11 months ago
- An Emulator and SDK for Intel SGX extension☆32Mar 6, 2017Updated 9 years ago
- ☆12Nov 10, 2020Updated 5 years ago
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆440Jan 22, 2025Updated last year