gdalmau / lakera-gandalf-solutionsLinks
My inputs for the LLM Gandalf made by Lakera
☆47Updated 2 years ago
Alternatives and similar repositories for lakera-gandalf-solutions
Users that are interested in lakera-gandalf-solutions are comparing it to the libraries listed below
Sorting:
- An example vulnerable app that integrates an LLM☆24Updated last year
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆134Updated 9 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆115Updated last year
- An interactive CLI application for interacting with authenticated Jupyter instances.☆55Updated 4 months ago
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆78Updated 4 months ago
- Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models☆77Updated this week
- Here Comes the AI Worm: Preventing the Propagation of Adversarial Self-Replicating Prompts Within GenAI Ecosystems☆205Updated last week
- Payloads for Attacking Large Language Models☆99Updated 3 months ago
- Lightweight LLM Interaction Framework☆375Updated this week
- LLM | Security | Operations in one github repo with good links and pictures.☆55Updated 8 months ago
- HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …☆56Updated last year
- using ML models for red teaming☆44Updated 2 years ago
- Example agents for the Dreadnode platform☆16Updated last month
- Manual Prompt Injection / Red Teaming Tool☆37Updated 11 months ago
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆185Updated 5 months ago
- Stage 1: Sensitive Email/Chat Classification for Adversary Agent Emulation (espionage). This project is meant to extend Red Reaper v1 whi …☆42Updated last year
- A utility to inspect, validate, sign and verify machine learning model files.☆58Updated 7 months ago
- A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.☆50Updated 10 months ago
- https://arxiv.org/abs/2412.02776☆62Updated 9 months ago
- This repository contains various attack against Large Language Models.☆114Updated last year
- A knowledge source about TTPs used to target GenAI-based systems, copilots and agents☆120Updated last month
- ☆69Updated 3 months ago
- Delving into the Realm of LLM Security: An Exploration of Offensive and Defensive Tools, Unveiling Their Present Capabilities.☆164Updated last year
- ☆17Updated 5 months ago
- Practical Jupyter notebooks from Andrew Ng and Giskard team's "Red Teaming LLM Applications" course on DeepLearning.AI.☆19Updated last year
- ☆54Updated this week
- Codebase of https://arxiv.org/abs/2410.14923☆50Updated 10 months ago
- 🤖 A GitHub action that leverages fabric patterns through an agent-based approach☆32Updated 8 months ago
- SourceGPT - prompt manager and source code analyzer built on top of ChatGPT as the oracle☆111Updated 2 years ago
- A writeup for the Gandalf prompt injection game.☆37Updated 2 years ago