microsoft / gandalf_vs_gandalfLinks
Turning Gandalf against itself. Use LLMs to automate playing Lakera Gandalf challenge without needing to set up an account with a platform provider.
☆28Updated last year
Alternatives and similar repositories for gandalf_vs_gandalf
Users that are interested in gandalf_vs_gandalf are comparing it to the libraries listed below
Sorting:
- ☆56Updated 5 months ago
- HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …☆57Updated last year
- Dropbox LLM Security research code and results☆235Updated last year
- A benchmark for prompt injection detection systems.☆142Updated last month
- Test Software for the Characterization of AI Technologies☆261Updated this week
- Make your GenAI Apps Safe & Secure Test & harden your system prompt☆572Updated 2 weeks ago
- Here Comes the AI Worm: Preventing the Propagation of Adversarial Self-Replicating Prompts Within GenAI Ecosystems☆211Updated last month
- ATLAS tactics, techniques, and case studies data☆80Updated last week
- Red-Teaming Language Models with DSPy☆216Updated 7 months ago
- The project serves as a strategic advisory tool, capitalizing on the ZySec series of AI models to amplify the capabilities of security pr…☆63Updated last year
- A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).☆148Updated last year
- ⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs☆417Updated last year
- [Corca / ML] Automatically solved Gandalf AI with LLM☆51Updated 2 years ago
- Codebase of https://arxiv.org/abs/2410.14923☆51Updated 11 months ago
- OWASP Machine Learning Security Top 10 Project☆92Updated 8 months ago
- ☆38Updated 9 months ago
- Curated list of Open Source project focused on LLM security☆62Updated 11 months ago
- Agentic Workflows Made Simple☆158Updated 6 months ago
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆112Updated last year
- Explore AI Supply Chain Risk with the AI Risk Database☆62Updated last year
- Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models☆81Updated this week
- Secure Jupyter Notebooks and Experimentation Environment☆84Updated 8 months ago
- The fastest Trust Layer for AI Agents☆143Updated 4 months ago
- Lightweight LLM Interaction Framework☆381Updated this week
- ☆76Updated this week
- Prompt Injection Primer for Engineers☆460Updated 2 years ago
- ☆260Updated last month
- Project Mantis: Hacking Back the AI-Hacker; Prompt Injection as a Defense Against LLM-driven Cyberattacks☆86Updated 4 months ago
- A guide to LLM hacking: fundamentals, prompt injection, offense, and defense☆172Updated 2 years ago
- A powerful tool that leverages AI to automatically generate comprehensive security documentation for your projects☆92Updated last month