gdalmau / lakera-gandalf-solutionsLinks

My inputs for the LLM Gandalf made by Lakera

☆46

Alternatives and similar repositories for lakera-gandalf-solutions

Users that are interested in lakera-gandalf-solutions are comparing it to the libraries listed below

Sorting:

Reapor-Yurnero / imprompter
Codebase of https://arxiv.org/abs/2410.14923
☆49Updated 9 months ago
PalisadeResearch / intercode
https://arxiv.org/abs/2412.02776
☆59Updated 8 months ago
BishopFox / BrokenHill
A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)
☆123Updated 7 months ago
NightmareLab / SourceGPT
SourceGPT - prompt manager and source code analyzer built on top of ChatGPT as the oracle
☆111Updated 2 years ago
mik0w / pallms
Payloads for Attacking Large Language Models
☆92Updated 2 months ago
sshh12 / llm_backdoor
Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…
☆179Updated 4 months ago
user1342 / Oversight
A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.
☆48Updated 9 months ago
ReversecLabs / llm-vulnerable-recruitment-app
An example vulnerable app that integrates an LLM
☆23Updated last year
dreadnode / robopages
A YAML based format for describing tools to LLMs, like man pages but for robots!
☆75Updated 3 months ago
xvnpw / fabric-agent-action
🤖 A GitHub action that leverages fabric patterns through an agent-based approach
☆30Updated 7 months ago
JosephTLucas / vger
An interactive CLI application for interacting with authenticated Jupyter instances.
☆53Updated 3 months ago
dreadnode / AIRTBench-Code
Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models
☆68Updated this week
StavC / ComPromptMized
ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications
☆203Updated last year
dreadnode / rigging
Lightweight LLM Interaction Framework
☆313Updated this week
peluche / deck-of-many-prompts
Manual Prompt Injection / Red Teaming Tool
☆35Updated 10 months ago
mrwadams / honeyagents
HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …
☆54Updated last year
LazaUK / DeepLearningAI-Giskard-RedTeaming
Practical Jupyter notebooks from Andrew Ng and Giskard team's "Red Teaming LLM Applications" course on DeepLearning.AI.
☆19Updated last year
gradio-app / safehttpx
☆70Updated last month
AndreySokolov247 / XSS-AGENT
Autonomous AI C2
☆31Updated last year
AI-Voodoo / Red_Reaper_v2
Stage 1: Sensitive Email/Chat Classification for Adversary Agent Emulation (espionage). This project is meant to extend Red Reaper v1 whi…
☆42Updated 11 months ago
dreadnode / parley
Tree of Attacks (TAP) Jailbreaking Implementation
☆114Updated last year
bsinger98 / Incalmo
☆45Updated this week
5stars217 / malicious_models
using ML models for red teaming
☆43Updated last year
PalisadeResearch / llm-honeypot
☆44Updated this week
wunderwuzzi23 / scratch
Repo with random useful scripts, utilities, prompts and stuff
☆140Updated last week
ScottLogic / prompt-injection
Application which investigates defensive measures against prompt injection attacks on an LLM, with a focus on the exposure of external to…
☆31Updated 9 months ago
mbrg / genai-attacks
A knowledge source about TTPs used to target GenAI-based systems, copilots and agents
☆43Updated 2 weeks ago
payloadartist / offensive-chatgpt
Offensive security use cases of ChatGPT
☆77Updated 2 years ago
pdparchitect / llm-hacking-database
This repository contains various attack against Large Language Models.
☆112Updated last year
carlospolop / github_archive_scraper
☆17Updated last year