wunderwuzzi23 / mlattacksLinks
Machine Learning Attack Series
☆72Updated last year
Alternatives and similar repositories for mlattacks
Users that are interested in mlattacks are comparing it to the libraries listed below
Sorting:
- A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).☆151Updated last year
- Codebase of https://arxiv.org/abs/2410.14923☆52Updated last year
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆193Updated 2 months ago
- Awesome products for securing AI systems includes open source and commercial options and an infographic licensed CC-BY-SA-4.0.☆80Updated last year
- A utility to inspect, validate, sign and verify machine learning model files.☆61Updated 10 months ago
- Here Comes the AI Worm: Preventing the Propagation of Adversarial Self-Replicating Prompts Within GenAI Ecosystems☆220Updated 3 months ago
- Code for the paper "Defeating Prompt Injections by Design"☆179Updated 6 months ago
- Dropbox LLM Security research code and results☆250Updated last year
- An interactive CLI application for interacting with authenticated Jupyter instances.☆54Updated 7 months ago
- Use LLMs for document ranking☆160Updated 8 months ago
- ☆153Updated 3 months ago
- Lightweight LLM Interaction Framework☆397Updated last week
- My inputs for the LLM Gandalf made by Lakera☆48Updated 2 years ago
- Project Mantis: Hacking Back the AI-Hacker; Prompt Injection as a Defense Against LLM-driven Cyberattacks☆92Updated 6 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆116Updated last year
- A JupyterLab extension to evaluate the security of your Jupyter environment☆39Updated 2 years ago
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆82Updated 7 months ago
- Multi-agent system (MAS) hijacking demos☆39Updated 2 weeks ago
- ☆50Updated last week
- Repository for CoSAI Workstream 4, Secure Design Patterns for Agentic Systems☆42Updated last week
- A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.☆52Updated last year
- ATHI — An AI Threat Modeling Framework for Policymakers☆58Updated 2 years ago
- Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models☆90Updated this week
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆112Updated last year
- Payloads for Attacking Large Language Models☆114Updated 6 months ago
- BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps☆63Updated 2 years ago
- Project LLM Verification Standard☆51Updated 2 months ago
- An open source investigation tool to collect and analyse public VK community wall posts☆35Updated 3 years ago
- ☆70Updated last month
- Secure Jupyter Notebooks and Experimentation Environment☆84Updated 10 months ago