wunderwuzzi23 / mlattacksLinks
Machine Learning Attack Series
☆68Updated last year
Alternatives and similar repositories for mlattacks
Users that are interested in mlattacks are comparing it to the libraries listed below
Sorting:
- An interactive CLI application for interacting with authenticated Jupyter instances.☆54Updated 3 months ago
- A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).☆142Updated last year
- Codebase of https://arxiv.org/abs/2410.14923☆50Updated 10 months ago
- Central repo for talks and presentations☆46Updated last year
- A utility to inspect, validate, sign and verify machine learning model files.☆58Updated 6 months ago
- A JupyterLab extension to evaluate the security of your Jupyter environment☆39Updated 2 years ago
- Use LLMs for document ranking☆145Updated 4 months ago
- Lightweight LLM Interaction Framework☆367Updated this week
- Awesome products for securing AI systems includes open source and commercial options and an infographic licensed CC-BY-SA-4.0.☆67Updated last year
- Tree of Attacks (TAP) Jailbreaking Implementation☆115Updated last year
- Dropbox LLM Security research code and results☆233Updated last year
- High signal information security sources Goggle.☆67Updated 2 years ago
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆78Updated 3 months ago
- ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications☆203Updated last year
- An environment for testing AI agents against networks using Metasploit.☆45Updated 2 years ago
- Code for the paper "Defeating Prompt Injections by Design"☆94Updated 2 months ago
- My inputs for the LLM Gandalf made by Lakera☆47Updated last year
- source code for the offsecml framework☆41Updated last year
- Data Scientists Go To Jupyter☆65Updated 5 months ago
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆183Updated 4 months ago
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆131Updated 8 months ago
- ☆70Updated 2 months ago
- Secure Jupyter Notebooks and Experimentation Environment☆80Updated 6 months ago
- Payloads for Attacking Large Language Models☆96Updated 2 months ago
- Project LLM Verification Standard☆48Updated 3 months ago
- ☆145Updated 3 months ago
- ☆65Updated this week
- The Privacy Adversarial Framework (PAF) is a knowledge base of privacy-focused adversarial tactics and techniques. PAF is heavily inspire…☆58Updated 2 years ago
- using ML models for red teaming☆44Updated 2 years ago
- ☆52Updated 2 weeks ago