gdalmau / lakera-gandalf-solutionsLinks
My inputs for the LLM Gandalf made by Lakera
☆46Updated 2 years ago
Alternatives and similar repositories for lakera-gandalf-solutions
Users that are interested in lakera-gandalf-solutions are comparing it to the libraries listed below
Sorting:
- LLM | Security | Operations in one github repo with good links and pictures.☆58Updated 9 months ago
- An example vulnerable app that integrates an LLM☆23Updated last year
- Payloads for Attacking Large Language Models☆102Updated 4 months ago
- A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.☆51Updated 11 months ago
- https://arxiv.org/abs/2412.02776☆62Updated 10 months ago
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆140Updated 9 months ago
- An interactive CLI application for interacting with authenticated Jupyter instances.☆55Updated 5 months ago
- AI cybersecurity agent for automated penetration testing and vulnerability assessment☆85Updated last week
- Tree of Attacks (TAP) Jailbreaking Implementation☆114Updated last year
- using ML models for red teaming☆44Updated 2 years ago
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆186Updated this week
- Codebase of https://arxiv.org/abs/2410.14923☆51Updated 11 months ago
- ☆69Updated 3 months ago
- 🤖 A GitHub action that leverages fabric patterns through an agent-based approach☆32Updated 9 months ago
- Machine Learning Attack Series☆69Updated last year
- ☆58Updated last week
- HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …☆57Updated last year
- A writeup for the Gandalf prompt injection game.☆37Updated 2 years ago
- source code for the offsecml framework☆42Updated last year
- Powerful LLM Query Framework with YAML Prompt Templates. Made for Automation☆33Updated 2 weeks ago
- ☆44Updated last week
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆77Updated 5 months ago
- Static code analyser for backdoors and malicious code in git repos using OpenAI compatible LLM APIs☆73Updated last year
- SourceGPT - prompt manager and source code analyzer built on top of ChatGPT as the oracle☆111Updated 2 years ago
- ☆14Updated last year
- Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models☆81Updated this week
- A utility to inspect, validate, sign and verify machine learning model files.☆58Updated 8 months ago
- Application which investigates defensive measures against prompt injection attacks on an LLM, with a focus on the exposure of external to…☆32Updated 11 months ago
- ☆17Updated 6 months ago
- Manual Prompt Injection / Red Teaming Tool☆42Updated last year