☆32Jan 26, 2025Updated last year
Alternatives and similar repositories for Code
Users that are interested in Code are comparing it to the libraries listed below
Sorting:
- ☆15Sep 17, 2024Updated last year
- [S&P 2026] SoK: Evaluating Jailbreak Guardrails for Large Language Models☆35Dec 17, 2025Updated 3 months ago
- ☆12Jul 21, 2023Updated 2 years ago
- Research Artifact of USENIX Security 2023 Paper: Precise and Generalized Robustness Certification for Neural Networks☆13Jun 20, 2023Updated 2 years ago
- Revisiting Cache Side-Channel Attacks in Deep Neural Networks Executables☆13Aug 27, 2024Updated last year
- Code for tracelet-level symbolic execution☆18Sep 18, 2022Updated 3 years ago
- CIPHERH: Automated Detection of Ciphertext Side-channel Vulnerabilities in Cryptographic Implementations☆13Dec 17, 2023Updated 2 years ago
- A test suite (a.k.a., dataset) with ~20k moral situations for understanding LLMs' behaviors.☆16May 5, 2023Updated 2 years ago
- OBsan: An Out-Of-Bound Sanitizer to Harden DNN Executables☆17Feb 28, 2023Updated 3 years ago
- Test equality between a black-box LLM API and a reference distribution☆12Oct 29, 2024Updated last year
- pytorch reimplementation for Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain☆11Oct 30, 2022Updated 3 years ago
- MCPSecBench: A Systematic Security Benchmark and Playground for Testing Model Context Protocols☆31Mar 4, 2026Updated 2 weeks ago
- This repository contains the evaluation code for the NDSS 2024 paper: MPCDIFF: Testing and Repairing MPC-Hardened Deep Learning Models.☆16Sep 5, 2023Updated 2 years ago
- Code release for "Idiosyncrasies in Large Language Models"☆55Jul 21, 2025Updated 7 months ago
- ☆27Sep 15, 2024Updated last year
- The official repository for guided jailbreak benchmark☆29Jul 28, 2025Updated 7 months ago
- Official implementation of ISSTA 2022 paper: MDPFuzz: Testing Models Solving Markov Decision Processes.☆24Dec 17, 2022Updated 3 years ago
- Code to generate NeuralExecs (prompt injection for LLMs)☆27Oct 5, 2025Updated 5 months ago
- Adversarial Examples Detection Benchmark☆17Dec 6, 2024Updated last year
- [NeurIPS'24] Protecting Your LLMs with Information Bottleneck☆27Nov 7, 2024Updated last year
- ☆14Jan 24, 2024Updated 2 years ago
- ☆11Dec 23, 2024Updated last year
- [SDM'23] ML4C: Seeing Causality Through Latent Vicinity☆14Nov 9, 2022Updated 3 years ago
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".☆55Updated this week
- A blog engine. Code for roselia.moe/blog☆10Feb 11, 2023Updated 3 years ago
- Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"☆66Oct 27, 2024Updated last year
- ☆48Sep 29, 2024Updated last year
- 使用rag来学习rag☆11Sep 6, 2024Updated last year
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆89Jul 24, 2025Updated 7 months ago
- [CVPR 2021] Official repository for "Prototype-supervised Adversarial Network for Targeted Attack of Deep Hashing"☆40Aug 28, 2022Updated 3 years ago
- Common MPC Pitfalls☆12Feb 14, 2026Updated last month
- ☆11Feb 22, 2024Updated 2 years ago
- ☆48Jun 16, 2025Updated 9 months ago
- Code for the paper "Explain Any Concept: Segment Anything Meets Concept-Based Explanation". Poster @ NeurIPS 2023☆46Dec 4, 2023Updated 2 years ago
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- Responsible Robotic Manipulation☆16Aug 31, 2025Updated 6 months ago
- ☆14Mar 9, 2025Updated last year
- Amoeba: Binary Code Diverisfication through Composite Software Diversification☆10Aug 3, 2017Updated 8 years ago
- Code for the paper: Fast and Private Inference of Deep Neural Networks by Co-designing Activation Functions☆11Mar 13, 2024Updated 2 years ago