humane-intelligence / ai_village_defcon_grt_data
☆13Updated 9 months ago
Alternatives and similar repositories for ai_village_defcon_grt_data:
Users that are interested in ai_village_defcon_grt_data are comparing it to the libraries listed below
- General research for Dreadnode☆19Updated 8 months ago
- Data Scientists Go To Jupyter☆62Updated last week
- All things specific to LLM Red Teaming Generative AI☆23Updated 4 months ago
- ☆64Updated last month
- A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)☆89Updated 2 months ago
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆108Updated last year
- source code for the offsecml framework☆38Updated 9 months ago
- The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench☆54Updated 2 weeks ago
- A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.☆60Updated 10 months ago
- Payloads for Attacking Large Language Models☆75Updated 8 months ago
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆80Updated 9 months ago
- Tree of Attacks (TAP) Jailbreaking Implementation☆102Updated last year
- ☆85Updated last week
- Codebase of https://arxiv.org/abs/2410.14923☆44Updated 4 months ago
- The automated prompt injection framework for LLM-integrated applications.☆186Updated 6 months ago
- The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".☆42Updated 4 months ago
- ATLAS tactics, techniques, and case studies data☆57Updated 5 months ago
- Integrate PyRIT in existing tools☆13Updated last week
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆48Updated this week
- A utility to inspect, validate, sign and verify machine learning model files.☆53Updated last month
- A YAML based format for describing tools to LLMs, like man pages but for robots!☆62Updated last month
- An interactive CLI application for interacting with authenticated Jupyter instances.☆50Updated 11 months ago
- A benchmark for prompt injection detection systems.☆98Updated last month
- [ICML 2024] COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability☆138Updated 2 months ago
- ☆16Updated 10 months ago
- ☆54Updated 8 months ago
- AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks☆38Updated 9 months ago
- ☆16Updated 9 months ago
- ☆42Updated 7 months ago