0din-ai / 0Din-Curated-Monthly-White-Papers
This repository curates a collection of monthly white papers focused on the latest LLM attack and defenses.
☆22Updated 6 months ago
Alternatives and similar repositories for 0Din-Curated-Monthly-White-Papers:
Users that are interested in 0Din-Curated-Monthly-White-Papers are comparing it to the libraries listed below
- A collection of prompt injection mitigation techniques.☆22Updated last year
- ☆34Updated 3 months ago
- Payloads for Attacking Large Language Models☆81Updated 9 months ago
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆109Updated last year
- Top 10 for Agentic AI (AI Agent Security) - Pre-release version☆84Updated last month
- Prompt Injections Everywhere☆118Updated 8 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆106Updated last year
- future-proof vulnerability detection benchmark, based on CVEs in open-source repos☆52Updated this week
- https://arxiv.org/abs/2412.02776☆52Updated 4 months ago
- ☆13Updated 4 months ago
- HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …☆47Updated last year
- XBOW Validation Benchmarks☆84Updated 7 months ago
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆106Updated 4 months ago
- A collection of awesome resources related AI security☆206Updated last week
- Codebase of https://arxiv.org/abs/2410.14923☆46Updated 6 months ago
- A LLM explicitly designed for getting hacked☆147Updated last year
- An interactive CLI application for interacting with authenticated Jupyter instances.☆53Updated last year
- This repository contains various attack against Large Language Models.☆104Updated 11 months ago
- Delving into the Realm of LLM Security: An Exploration of Offensive and Defensive Tools, Unveiling Their Present Capabilities.☆162Updated last year
- LLMBUS red team tool 🚍☆36Updated 2 months ago
- A list of curated resources for people interested in AI Red Teaming, Jailbreaking, and Prompt Injection☆101Updated 2 weeks ago
- A guide to LLM hacking: fundamentals, prompt injection, offense, and defense☆148Updated 2 years ago
- Dropbox LLM Security research code and results☆222Updated 11 months ago
- ☆60Updated this week
- A research project to add some brrrrrr to Burp☆155Updated 2 months ago
- LLM Testing Findings Templates☆71Updated last year
- Red-Teaming Language Models with DSPy☆183Updated 2 months ago
- ☆64Updated 3 months ago
- A knowledge source about TTPs used to target GenAI-based systems, copilots and agents☆34Updated last month
- source code for the offsecml framework☆38Updated 10 months ago