A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆112Mar 12, 2024Updated 2 years ago
Alternatives and similar repositories for lve
Users that are interested in lve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Guardrails for secure and robust agent development☆410Jan 12, 2026Updated 3 months ago
- A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)☆54Jul 27, 2025Updated 8 months ago
- ☆19Jun 18, 2025Updated 9 months ago
- Explore AI Supply Chain Risk with the AI Risk Database☆72May 8, 2024Updated last year
- ☆10Oct 31, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 9 months ago
- This repository contains code and data of the paper **On the Limitations of Continual Learning for Malware Classification**, accepted to …☆19Dec 29, 2023Updated 2 years ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆52Jan 12, 2026Updated 3 months ago
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆527Mar 30, 2026Updated 2 weeks ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Aug 11, 2022Updated 3 years ago
- Payloads for Attacking Large Language Models☆131Jan 13, 2026Updated 3 months ago
- Code for our paper "Localizing Lying in Llama"☆13Apr 24, 2025Updated 11 months ago
- Code used to run the platform for the LLM CTF colocated with SaTML 2024☆28Mar 20, 2024Updated 2 years ago
- ☆54Feb 24, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Jun 12, 2023Updated 2 years ago
- Scripts that I've written that others may find useful☆13Aug 17, 2022Updated 3 years ago
- ☆41Dec 9, 2025Updated 4 months ago
- ☆12Dec 2, 2021Updated 4 years ago
- ☆20Dec 4, 2023Updated 2 years ago
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆70Feb 22, 2024Updated 2 years ago
- Binary Ninja Plugin for Generating Callgraphs☆17Jun 17, 2025Updated 10 months ago
- MOCK: Optimizing Kernel Fuzzing Mutation with Context-aware Dependency☆22Dec 21, 2024Updated last year
- Code for the paper Explanation-Guided Backdoor Poisoning Attacks Against Malware Classifiers☆60Apr 29, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CCS 2023 | Explainable malware and vulnerability detection with XAI in paper "FINER: Enhancing State-of-the-art Classifiers with Feature …☆11Aug 20, 2024Updated last year
- [ICML 2025] UDora: A Unified Red Teaming Framework against LLM Agents☆33Jun 24, 2025Updated 9 months ago
- Linux #rootkit and #malware revealer☆31Aug 1, 2024Updated last year
- ☆25Mar 26, 2025Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- AI Robustness Evaluation System☆38Updated this week
- [S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models☆19Feb 18, 2025Updated last year
- [NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents☆70Nov 14, 2025Updated 5 months ago
- ☆73Feb 16, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Test & Compare different Kubernetes security offerings on EKS, GKE and AKS☆40Aug 29, 2024Updated last year
- TabLeak: Tabular Data Leakage in Federated Learning☆17Jul 4, 2024Updated last year
- Security research helper for CLFS drivers☆16Sep 5, 2024Updated last year
- Risks and targets for assessing LLMs & LLM vulnerabilities☆34May 27, 2024Updated last year
- The backend behind the LLM-Perf Leaderboard☆11May 5, 2024Updated last year
- PFI: Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents☆27Mar 26, 2025Updated last year
- Constrained Decoding of Diffusion LLMs with Context-Free Grammars.☆44Dec 17, 2025Updated 3 months ago