amazon-science / TurboFuzzLLMLinks
TurboFuzzLLM: Turbocharging Mutation-based Fuzzing for Effectively Jailbreaking Large Language Models in Practice
☆22Updated 2 months ago
Alternatives and similar repositories for TurboFuzzLLM
Users that are interested in TurboFuzzLLM are comparing it to the libraries listed below
Sorting:
- ☆190Updated last month
- ☆34Updated last year
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆112Updated last year
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆79Updated 5 months ago
- A utility to inspect, validate, sign and verify machine learning model files.☆65Updated last year
- A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.☆103Updated last year
- ☆133Updated 6 months ago
- Cyber-Zero: Training Cybersecurity Agents Without Runtime☆69Updated last week
- LLM security and privacy☆54Updated last year
- An environment simulation for networks security tasks for development and testing AI based agents. Part of AI Dojo project☆57Updated 2 weeks ago
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆91Updated last year
- General research for Dreadnode☆27Updated last year
- Code for the paper "Defeating Prompt Injections by Design"☆232Updated 7 months ago
- This project investigates the security of large language models by performing binary classification of a set of input prompts to discover…☆56Updated 2 years ago
- ☆23Updated 2 years ago
- LLM proxy to observe and debug what your AI agents are doing.☆63Updated 3 months ago
- SECURE: Benchmarking Generative Large Language Models as a Cyber Advisory☆15Updated last year
- Do you want to learn AI Security but don't know where to start ? Take a look at this map.☆29Updated last year
- CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on…☆108Updated 3 weeks ago
- An Execution Isolation Architecture for LLM-Based Agentic Systems☆103Updated last year
- ☆81Updated 3 months ago
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆425Updated this week
- A collection of prompt injection mitigation techniques.☆27Updated 2 years ago
- This dataset contains results from all rounds of Adversarial Nibbler. This data includes adversarial prompts fed into public generative t…☆25Updated last year
- The official repository of the paper "On the Exploitability of Instruction Tuning".☆69Updated 2 years ago
- ☆18Updated last year
- ☆156Updated 5 months ago
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆57Updated last year
- A library for red-teaming LLM applications with LLMs.☆29Updated last year
- Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"☆54Updated last year