A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆112Mar 12, 2024Updated last year
Alternatives and similar repositories for lve
Users that are interested in lve are comparing it to the libraries listed below
Sorting:
- ☆10Oct 31, 2022Updated 3 years ago
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆454Feb 3, 2026Updated last month
- DEF CON 31 AI Village - LLMs: Loose Lips Multipliers☆10Aug 16, 2023Updated 2 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 8 months ago
- Code for our paper "Localizing Lying in Llama"☆13Apr 24, 2025Updated 10 months ago
- ☆12Dec 2, 2021Updated 4 years ago
- Security research helper for CLFS drivers☆16Sep 5, 2024Updated last year
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆70Feb 22, 2024Updated 2 years ago
- One Conference 2024☆111Oct 1, 2024Updated last year
- ☆36Dec 9, 2025Updated 2 months ago
- A framework for understanding the capabilities of automated detection methods at identifying classes of application security vulnerabilit…☆33Feb 13, 2026Updated 3 weeks ago
- MOCK: Optimizing Kernel Fuzzing Mutation with Context-aware Dependency☆20Dec 21, 2024Updated last year
- Test & Compare different Kubernetes security offerings on EKS, GKE and AKS☆40Aug 29, 2024Updated last year
- ☆25Mar 26, 2025Updated 11 months ago
- [NeurIPS'24] Protecting Your LLMs with Information Bottleneck☆25Nov 7, 2024Updated last year
- ☆32Dec 3, 2025Updated 3 months ago
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆116Jun 13, 2024Updated last year
- A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)☆52Jul 27, 2025Updated 7 months ago
- ☆20Dec 4, 2023Updated 2 years ago
- [NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents☆66Nov 14, 2025Updated 3 months ago
- [S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models☆19Feb 18, 2025Updated last year
- Seminar 2022☆23Updated this week
- Caterpillar is a security scanning library for AI agent skill files (e.g., Claude Code skills) for dangerous or malicious behavior☆33Feb 16, 2026Updated 2 weeks ago
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆90May 19, 2024Updated last year
- ☆30Jun 19, 2023Updated 2 years ago
- ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.☆93May 9, 2024Updated last year
- LLM Prompt Injection Detector☆1,426Aug 7, 2024Updated last year
- Code for shelLM tool☆57Jan 28, 2025Updated last year
- A simple script which implements different Cognito attacks such as Account Oracle or Priviledge Escalation☆109Feb 16, 2024Updated 2 years ago
- Official repo for the paper "Make Some Noise: Reliable and Efficient Single-Step Adversarial Training" (https://arxiv.org/abs/2202.01181)☆25Oct 17, 2022Updated 3 years ago
- A comprehensive database of Model Context Protocol vulnerabilities, security research, and exploits☆34Feb 16, 2026Updated 2 weeks ago
- MLOps Attack Toolkit☆30Aug 25, 2025Updated 6 months ago
- ☆105Dec 9, 2025Updated 2 months ago
- python package of rocm-smi-lib☆24Dec 15, 2025Updated 2 months ago
- Dropbox LLM Security research code and results☆255May 21, 2024Updated last year
- ☆71Feb 16, 2025Updated last year
- Critical Vulnerabilities in Trend Micro Deep Security Agent for Linux☆26Jan 19, 2022Updated 4 years ago
- Seamless AI Integration into Caido☆42Feb 23, 2026Updated last week