humane-intelligence / ai_village_defcon_grt_dataLinks

☆14

Alternatives and similar repositories for ai_village_defcon_grt_data

Users that are interested in ai_village_defcon_grt_data are comparing it to the libraries listed below

Sorting:

dreadnode / research
General research for Dreadnode
☆25Updated last year
pasquini-dario / LLMmap
☆84Updated 3 months ago
lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆112Updated last year
NickNameInvalid / LLM_CTF
☆65Updated 2 months ago
xsankar / AI-Red-Teaming
All things specific to LLM Red Teaming Generative AI
☆29Updated last year
sunblaze-ucb / cybergym
CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on…
☆94Updated last month
BishopFox / BrokenHill
A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)
☆145Updated 11 months ago
mik0w / pallms
Payloads for Attacking Large Language Models
☆104Updated 5 months ago
google-research / camel-prompt-injection
Code for the paper "Defeating Prompt Injections by Design"
☆150Updated 5 months ago
Valhall-ai / prompt-injection-mitigations
A collection of prompt injection mitigation techniques.
☆24Updated 2 years ago
vinusankars / BEAST
Implementation of BEAST adversarial attack for language models (ICML 2024)
☆91Updated last year
moohax / Charcuterie
Data Scientists Go To Jupyter
☆67Updated 8 months ago
ReversecLabs / spikee
☆107Updated last week
dreadnode / parley
Tree of Attacks (TAP) Jailbreaking Implementation
☆115Updated last year
LLMSecurity / HouYi
The automated prompt injection framework for LLM-integrated applications.
☆237Updated last year
dreadnode / example-agents
Example agents for the Dreadnode platform
☆19Updated this week
invariantlabs-ai / mcp-injection-experiments
Code snippets to reproduce MCP tool poisoning attacks.
☆187Updated 7 months ago
lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆150Updated 2 months ago
Libr-AI / OpenRedTeaming
Papers about red teaming LLMs and Multimodal models.
☆154Updated 5 months ago
AIM-Intelligence / Automated-Multi-Turn-Jailbreaks
☆98Updated last year
microsoft / BIPIA
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
☆88Updated last year
timothee-chauvin / eyeballvul
future-proof vulnerability detection benchmark, based on CVEs in open-source repos
☆60Updated last week
PalisadeResearch / intercode
https://arxiv.org/abs/2412.02776
☆66Updated 11 months ago
ethz-spylab / agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆348Updated 3 weeks ago
leondz / lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
☆32Updated last year
briland / LLM-security-and-privacy
LLM security and privacy
☆51Updated last year
NYU-LLM-CTF / nyuctf_agents
The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench
☆102Updated 3 weeks ago
liu00222 / Open-Prompt-Injection
This repository provides a benchmark for prompt injection attacks and defenses
☆343Updated 3 weeks ago
5stars217 / offsecml
source code for the offsecml framework
☆43Updated last year
mnns / LLMFuzzer
🧠 LLMFuzzer - Fuzzing Framework for Large Language Models 🧠 LLMFuzzer is the first open-source fuzzing framework specifically designed …
☆327Updated last year