microsoft / gandalf_vs_gandalf
Turning Gandalf against itself. Use LLMs to automate playing Lakera Gandalf challenge without needing to set up an account with a platform provider.
☆27Updated last year
Alternatives and similar repositories for gandalf_vs_gandalf:
Users that are interested in gandalf_vs_gandalf are comparing it to the libraries listed below
- HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …☆39Updated last year
- Red-Teaming Language Models with DSPy☆153Updated 9 months ago
- [Corca / ML] Automatically solved Gandalf AI with LLM☆47Updated last year
- ☆27Updated last month
- Guard your LangChain applications against prompt injection with Lakera ChainGuard.☆18Updated this week
- A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).☆125Updated last year
- Make your GenAI Apps Safe & Secure Test & harden your system prompt☆429Updated 3 months ago
- A benchmark for prompt injection detection systems.☆94Updated 4 months ago
- ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications☆197Updated 10 months ago
- ☆67Updated 2 months ago
- Project LLM Verification Standard☆37Updated 9 months ago
- ☆15Updated last month
- Dropbox LLM Security research code and results☆219Updated 7 months ago
- Scripts, notebooks, and articles about data science in general.☆45Updated last year
- The project serves as a strategic advisory tool, capitalizing on the ZySec series of AI models to amplify the capabilities of security pr…☆43Updated 7 months ago
- source for llmsec.net☆13Updated 5 months ago
- ⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs☆339Updated 11 months ago
- A writeup for the Gandalf prompt injection game.☆36Updated last year
- Risks and targets for assessing LLMs & LLM vulnerabilities☆30Updated 7 months ago
- Project Mantis: Hacking Back the AI-Hacker; Prompt Injection as a Defense Against LLM-driven Cyberattacks☆59Updated last month
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆104Updated 4 months ago
- A script that will generate a fine-tuning file for openai's fine-tuning feature☆14Updated last year
- Ai power Dev using the rUv approach☆65Updated 2 months ago
- Lakera - ChatGPT Data Leak Protection☆22Updated 6 months ago
- AI-powered tool designed to help producing Threat Intelligence Mindmap.☆82Updated 2 weeks ago
- Curation of prompts that are known to be adversarial to large language models☆177Updated last year
- Approximation of the Claude 3 tokenizer by inspecting generation stream☆119Updated 5 months ago
- Security and compliance proxy for LLM APIs☆45Updated last year
- The application of multimodal RAG for Sustainable finance☆17Updated 5 months ago
- ☆26Updated 2 months ago