LazaUK / DeepLearningAI-Giskard-RedTeamingLinks
Practical Jupyter notebooks from Andrew Ng and Giskard team's "Red Teaming LLM Applications" course on DeepLearning.AI.
β22Updated last year
Alternatives and similar repositories for DeepLearningAI-Giskard-RedTeaming
Users that are interested in DeepLearningAI-Giskard-RedTeaming are comparing it to the libraries listed below
Sorting:
- The fastest Trust Layer for AI Agentsβ146Updated 6 months ago
- π€π‘οΈπππ Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.β27Updated last year
- Secure Jupyter Notebooks and Experimentation Environmentβ84Updated 10 months ago
- β98Updated 4 months ago
- A prompt defence is a multi-layer defence that can be used to protect your applications against prompt injection attacks.β20Updated last week
- Risks and targets for assessing LLMs & LLM vulnerabilitiesβ33Updated last year
- using ML models for red teamingβ45Updated 2 years ago
- Payloads for Attacking Large Language Modelsβ114Updated 6 months ago
- My inputs for the LLM Gandalf made by Lakeraβ48Updated 2 years ago
- LLM | Security | Operations in one github repo with good links and pictures.β71Updated last week
- Framework for LLM evaluation, guardrails and securityβ114Updated last year
- All things specific to LLM Red Teaming Generative AIβ29Updated last year
- LLM security and privacyβ52Updated last year
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injectionβ¦β45Updated 9 months ago
- Repository for CoSAI Workstream 4, Secure Design Patterns for Agentic Systemsβ42Updated last week
- Code snippets to reproduce MCP tool poisoning attacks.β187Updated 8 months ago
- Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Modelsβ90Updated this week
- https://arxiv.org/abs/2412.02776β66Updated last year
- A collection of prompt injection mitigation techniques.β25Updated 2 years ago
- A repository of Language Model Vulnerabilities and Exposures (LVEs).β112Updated last year
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)β150Updated last year
- Dropbox LLM Security research code and resultsβ250Updated last year
- MCPSafetyScanner - Automated MCP safety auditing and remediation using Agents. More info: https://www.arxiv.org/abs/2504.03767β157Updated 8 months ago
- Rapidly identify and mitigate container security vulnerabilities with generative AI.β182Updated last week
- β66Updated 3 months ago
- Project Mantis: Hacking Back the AI-Hacker; Prompt Injection as a Defense Against LLM-driven Cyberattacksβ92Updated 6 months ago
- Codebase of https://arxiv.org/abs/2410.14923β52Updated last year
- β29Updated 6 months ago
- Code for the paper "Defeating Prompt Injections by Design"β179Updated 6 months ago
- Top 10 for Agentic AI (AI Agent Security) serves as the core for OWASP and CSA Red teaming workβ157Updated 2 months ago