wunderwuzzi23 / mlattacksLinks

Machine Learning Attack Series

☆69

Alternatives and similar repositories for mlattacks

Users that are interested in mlattacks are comparing it to the libraries listed below

Sorting:

Reapor-Yurnero / imprompter
Codebase of https://arxiv.org/abs/2410.14923
☆51Updated last year
dropbox / llm-security
Dropbox LLM Security research code and results
☆237Updated last year
JosephTLucas / vger
An interactive CLI application for interacting with authenticated Jupyter instances.
☆55Updated 5 months ago
safellama / plexiglass
A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).
☆150Updated last year
trailofbits / awesome-ml-security
☆151Updated last month
gdalmau / lakera-gandalf-solutions
My inputs for the LLM Gandalf made by Lakera
☆46Updated 2 years ago
5stars217 / offsecml
source code for the offsecml framework
☆42Updated last year
BishopFox / raink
Use LLMs for document ranking
☆151Updated 6 months ago
sshh12 / llm_backdoor
Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…
☆186Updated 3 weeks ago
dreadnode / parley
Tree of Attacks (TAP) Jailbreaking Implementation
☆114Updated last year
trailofbits / pajaMAS
Multi-agent system (MAS) hijacking demos
☆37Updated 3 weeks ago
zmre / awesome-security-for-ai
Awesome products for securing AI systems includes open source and commercial options and an infographic licensed CC-BY-SA-4.0.
☆73Updated last year
5stars217 / malicious_models
using ML models for red teaming
☆44Updated 2 years ago
mik0w / pallms
Payloads for Attacking Large Language Models
☆104Updated 4 months ago
dreadnode / tensor-man
A utility to inspect, validate, sign and verify machine learning model files.
☆59Updated 8 months ago
JosephTLucas / lintML
A security-first linter for code that shouldn't need linting
☆16Updated 2 years ago
lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆112Updated last year
phreakAI / metasploit-gym
An environment for testing AI agents against networks using Metasploit.
☆45Updated 2 years ago
google-research / camel-prompt-injection
Code for the paper "Defeating Prompt Injections by Design"
☆138Updated 4 months ago
dreadnode / AIRTBench-Code
Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models
☆83Updated last week
JosephTLucas / jupysec
A JupyterLab extension to evaluate the security of your Jupyter environment
☆39Updated 2 years ago
protectai / nbdefense
Secure Jupyter Notebooks and Experimentation Environment
☆84Updated 8 months ago
moohax / Charcuterie
Data Scientists Go To Jupyter
☆67Updated 7 months ago
dreadnode / robopages
A YAML based format for describing tools to LLMs, like man pages but for robots!
☆78Updated 5 months ago
user1342 / Oversight
A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.
☆52Updated 11 months ago
BishopFox / BrokenHill
A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)
☆142Updated 10 months ago
facebookresearch / privacy_adversarial_framework
The Privacy Adversarial Framework (PAF) is a knowledge base of privacy-focused adversarial tactics and techniques. PAF is heavily inspire…
☆59Updated 2 years ago
ReversecLabs / llm-vulnerable-recruitment-app
An example vulnerable app that integrates an LLM
☆24Updated last year
moohax / Talks
Central repo for talks and presentations
☆46Updated last year
pasquini-dario / project_mantis
Project Mantis: Hacking Back the AI-Hacker; Prompt Injection as a Defense Against LLM-driven Cyberattacks
☆88Updated 5 months ago