wunderwuzzi23 / mlattacksLinks
Machine Learning Attack Series
☆73Updated last year
Alternatives and similar repositories for mlattacks
Users that are interested in mlattacks are comparing it to the libraries listed below
Sorting:
- Codebase of https://arxiv.org/abs/2410.14923☆52Updated last year
- Dropbox LLM Security research code and results☆250Updated last year
- My inputs for the LLM Gandalf made by Lakera☆48Updated 2 years ago
- A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).☆151Updated 2 years ago
- A JupyterLab extension to evaluate the security of your Jupyter environment☆39Updated 2 years ago
- Awesome products for securing AI systems includes open source and commercial options and an infographic licensed CC-BY-SA-4.0.☆80Updated last year
- ☆154Updated 4 months ago
- A utility to inspect, validate, sign and verify machine learning model files.☆63Updated 11 months ago
- Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models☆92Updated this week
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆82Updated 8 months ago
- Payloads for Attacking Large Language Models☆116Updated 7 months ago
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆201Updated 3 months ago
- source code for the offsecml framework☆46Updated last year
- Lightweight LLM Interaction Framework☆402Updated this week
- Use LLMs for document ranking☆160Updated 8 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆117Updated last year
- Multi-agent system (MAS) hijacking demos☆39Updated this week
- An interactive CLI application for interacting with authenticated Jupyter instances.☆55Updated 8 months ago
- Data Scientists Go To Jupyter☆68Updated 10 months ago
- Central repo for talks and presentations☆47Updated last year
- ☆44Updated last year
- An environment for testing AI agents against networks using Metasploit.☆45Updated 2 years ago
- Repository for CoSAI Workstream 4, Secure Design Patterns for Agentic Systems☆45Updated last month
- The Privacy Adversarial Framework (PAF) is a knowledge base of privacy-focused adversarial tactics and techniques. PAF is heavily inspire…☆57Updated 2 years ago
- Project Mantis: Hacking Back the AI-Hacker; Prompt Injection as a Defense Against LLM-driven Cyberattacks☆93Updated 7 months ago
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10Updated 3 years ago
- This repository is for administrative documents for the CoSAI OASIS Open Project☆70Updated this week
- RedFlag uses AI to identify high-risk code changes. Run it in batch mode for release candidate testing or in CI pipelines to flag PRs and…☆158Updated last year
- BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps☆63Updated 2 years ago
- Code for the paper "Defeating Prompt Injections by Design"☆205Updated 6 months ago