gdalmau / lakera-gandalf-solutionsLinks
My inputs for the LLM Gandalf made by Lakera
☆48Updated 2 years ago
Alternatives and similar repositories for lakera-gandalf-solutions
Users that are interested in lakera-gandalf-solutions are comparing it to the libraries listed below
Sorting:
- https://arxiv.org/abs/2412.02776☆67Updated last year
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆152Updated last year
- Tree of Attacks (TAP) Jailbreaking Implementation☆117Updated last year
- using ML models for red teaming☆45Updated 2 years ago
- Stage 1: Sensitive Email/Chat Classification for Adversary Agent Emulation (espionage). This project is meant to extend Red Reaper v1 whi…☆42Updated last year
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆82Updated 8 months ago
- source code for the offsecml framework☆46Updated last year
- An interactive CLI application for interacting with authenticated Jupyter instances.☆55Updated 8 months ago
- Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models☆92Updated this week
- An example vulnerable app that integrates an LLM☆26Updated last year
- ☆20Updated 9 months ago
- Codebase of https://arxiv.org/abs/2410.14923☆52Updated last year
- LLM | Security | Operations in one github repo with good links and pictures.☆86Updated last week
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆201Updated 3 months ago
- Machine Learning Attack Series☆73Updated last year
- HoneyAgents is a PoC demo of an AI-driven system that combines honeypots with autonomous AI agents to detect and mitigate cyber threats. …☆58Updated 2 years ago
- Payloads for Attacking Large Language Models☆116Updated 7 months ago
- Repository for CoSAI Workstream 4, Secure Design Patterns for Agentic Systems☆45Updated last month
- A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.☆54Updated last year
- A utility to inspect, validate, sign and verify machine learning model files.☆63Updated 11 months ago
- A knowledge source about TTPs used to target GenAI-based systems, copilots and agents☆132Updated 2 weeks ago
- An AI-driven MCP server that autonomously interfaces with Malware Bazaar, delivering real-time threat intel and sample metadata for autho…☆25Updated last month
- All things specific to LLM Red Teaming Generative AI☆29Updated last year
- CLI and API server for https://github.com/dreadnode/robopages☆38Updated last week
- Example agents for the Dreadnode platform☆22Updated 3 weeks ago
- ☆10Updated last year
- A writeup for the Gandalf prompt injection game.☆39Updated 2 years ago
- ☆126Updated 3 weeks ago
- ATHF is a framework for agentic threat hunting - building systems that can remember, learn, and act with increasing autonomy.☆152Updated this week
- Autonomous AI C2☆33Updated last year