leondz / lm_risk_cardsLinks
Risks and targets for assessing LLMs & LLM vulnerabilities
☆32Updated last year
Alternatives and similar repositories for lm_risk_cards
Users that are interested in lm_risk_cards are comparing it to the libraries listed below
Sorting:
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆113Updated last year
- A collection of prompt injection mitigation techniques.☆23Updated last year
- A benchmark for prompt injection detection systems.☆124Updated 3 weeks ago
- LLM security and privacy☆49Updated 9 months ago
- ☆130Updated last month
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆226Updated last week
- ⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs☆402Updated last year
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆90Updated last year
- This repository provides a benchmark for prompt Injection attacks and defenses☆255Updated 3 weeks ago
- Whispers in the Machine: Confidentiality in Agentic Systems☆39Updated 2 months ago
- Top 10 for Agentic AI (AI Agent Security) serves as the core for OWASP and CSA Red teaming work☆124Updated last month
- Dropbox LLM Security research code and results☆231Updated last year
- ☆51Updated 2 weeks ago
- ATLAS tactics, techniques, and case studies data☆77Updated 3 months ago
- Universal Robustness Evaluation Toolkit (for Evasion)☆31Updated 3 months ago
- A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.☆73Updated last year
- 🧠 LLMFuzzer - Fuzzing Framework for Large Language Models 🧠 LLMFuzzer is the first open-source fuzzing framework specifically designed …☆303Updated last year
- ☆139Updated 2 months ago
- Secure Jupyter Notebooks and Experimentation Environment☆78Updated 6 months ago
- PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…☆401Updated last year
- ☆53Updated 3 months ago
- LLM | Security | Operations in one github repo with good links and pictures.☆35Updated 7 months ago
- Every practical and proposed defense against prompt injection.☆503Updated 5 months ago
- The fastest Trust Layer for AI Agents☆140Updated 2 months ago
- A prompt injection game to collect data for robust ML research☆62Updated 6 months ago
- ☆70Updated last year
- Codebase of https://arxiv.org/abs/2410.14923☆49Updated 9 months ago
- Papers about red teaming LLMs and Multimodal models.☆131Updated 2 months ago
- ☆82Updated 8 months ago
- CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on…☆49Updated last week