gdalmau / lakera-gandalf-solutionsLinks
My inputs for the LLM Gandalf made by Lakera
☆46Updated last year
Alternatives and similar repositories for lakera-gandalf-solutions
Users that are interested in lakera-gandalf-solutions are comparing it to the libraries listed below
Sorting:
- Codebase of https://arxiv.org/abs/2410.14923☆49Updated 9 months ago
- https://arxiv.org/abs/2412.02776☆59Updated 8 months ago
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆123Updated 7 months ago
- SourceGPT - prompt manager and source code analyzer built on top of ChatGPT as the oracle☆111Updated 2 years ago
- Payloads for Attacking Large Language Models☆92Updated 2 months ago
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆179Updated 4 months ago
- A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.☆48Updated 9 months ago
- An example vulnerable app that integrates an LLM☆23Updated last year
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆75Updated 3 months ago
- 🤖 A GitHub action that leverages fabric patterns through an agent-based approach☆30Updated 7 months ago
- An interactive CLI application for interacting with authenticated Jupyter instances.☆53Updated 3 months ago
- Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models☆68Updated this week
- ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications☆203Updated last year
- Lightweight LLM Interaction Framework☆313Updated this week
- Manual Prompt Injection / Red Teaming Tool☆35Updated 10 months ago
- HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …☆54Updated last year
- Practical Jupyter notebooks from Andrew Ng and Giskard team's "Red Teaming LLM Applications" course on DeepLearning.AI.☆19Updated last year
- ☆70Updated last month
- Autonomous AI C2☆31Updated last year
- Stage 1: Sensitive Email/Chat Classification for Adversary Agent Emulation (espionage). This project is meant to extend Red Reaper v1 whi…☆42Updated 11 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆114Updated last year
- ☆45Updated this week
- using ML models for red teaming☆43Updated last year
- ☆44Updated this week
- Repo with random useful scripts, utilities, prompts and stuff☆140Updated last week
- Application which investigates defensive measures against prompt injection attacks on an LLM, with a focus on the exposure of external to…☆31Updated 9 months ago
- A knowledge source about TTPs used to target GenAI-based systems, copilots and agents☆43Updated 2 weeks ago
- Offensive security use cases of ChatGPT☆77Updated 2 years ago
- This repository contains various attack against Large Language Models.☆112Updated last year
- ☆17Updated last year