microsoft / gandalf_vs_gandalfLinks

Turning Gandalf against itself. Use LLMs to automate playing Lakera Gandalf challenge without needing to set up an account with a platform provider.

☆28

Alternatives and similar repositories for gandalf_vs_gandalf

Users that are interested in gandalf_vs_gandalf are comparing it to the libraries listed below

Sorting:

kenhuangus / Top-Threats-for-AI-Agents
☆52Updated 2 months ago
mrwadams / honeyagents
HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …
☆53Updated last year
prompt-security / ps-fuzz
Make your GenAI Apps Safe & Secure Test & harden your system prompt
☆519Updated last month
dropbox / llm-security
Dropbox LLM Security research code and results
☆228Updated last year
usnistgov / dioptra
Test Software for the Characterization of AI Technologies
☆260Updated this week
corca-ai / LLMFuzzAgent
[Corca / ML] Automatically solved Gandalf AI with LLM
☆50Updated 2 years ago
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆202Updated 5 months ago
deadbits / vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
☆396Updated last year
lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆122Updated 2 months ago
StavC / ComPromptMized
ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications
☆203Updated last year
precize / Agentic-AI-Top10-Vulnerability
Top 10 for Agentic AI (AI Agent Security) serves as the core for OWASP and CSA Red teaming work
☆119Updated last month
leondz / lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
☆31Updated last year
kaplanlior / oss-llm-security
Curated list of Open Source project focused on LLM security
☆50Updated 8 months ago
ZenGuard-AI / fast-llm-security-guardrails
The fastest Trust Layer for AI Agents
☆138Updated last month
aiverify-foundation / moonshot
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
☆258Updated this week
pasquini-dario / project_mantis
Project Mantis: Hacking Back the AI-Hacker; Prompt Injection as a Defense Against LLM-driven Cyberattacks
☆71Updated last month
ZySec-AI / project-zysec
The project serves as a strategic advisory tool, capitalizing on the ZySec series of AI models to amplify the capabilities of security pr…
☆53Updated last year
lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆112Updated last year
OWASP / www-project-ai-security-and-privacy-guide
OWASP Foundation Web Respository
☆284Updated this week
utkusen / promptmap
a prompt injection scanner for custom LLM applications
☆835Updated 4 months ago
cybershujin / Threat-Actors-use-of-Artifical-Intelligence
☆254Updated 6 months ago
mitre-atlas / atlas-data
ATLAS tactics, techniques, and case studies data
☆77Updated 2 months ago
tldrsec / prompt-injection-defenses
Every practical and proposed defense against prompt injection.
☆495Updated 4 months ago
jthack / PIPE
Prompt Injection Primer for Engineers
☆443Updated last year
safellama / plexiglass
A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).
☆139Updated last year
mitre-atlas / ai-risk-database
Explore AI Supply Chain Risk with the AI Risk Database
☆59Updated last year
Seezo-io / llm-security-101
Delving into the Realm of LLM Security: An Exploration of Offensive and Defensive Tools, Unveiling Their Present Capabilities.
☆163Updated last year
dreadnode / rigging
Lightweight LLM Interaction Framework
☆296Updated this week
Giskard-AI / awesome-ai-safety
📚 A curated list of papers & technical articles on AI Quality & Safety
☆188Updated 3 months ago
xvnpw / fabric-agent-action
🤖 A GitHub action that leverages fabric patterns through an agent-based approach
☆28Updated 6 months ago