lakeraai / canicaLinks

A text embedding viewer for the Jupyter environment

☆20

Alternatives and similar repositories for canica

Users that are interested in canica are comparing it to the libraries listed below

Sorting:

lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆120Updated last month
lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆112Updated last year
ethz-spylab / agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆193Updated last week
corca-ai / LLMFuzzAgent
[Corca / ML] Automatically solved Gandalf AI with LLM
☆50Updated last year
microsoft / gandalf_vs_gandalf
Turning Gandalf against itself. Use LLMs to automate playing Lakera Gandalf challenge without needing to set up an account with a platfor…
☆29Updated last year
lakeraai / chrome-extension
Lakera - ChatGPT Data Leak Protection
☆22Updated 11 months ago
lakeraai / chainguard
Guard your LangChain applications against prompt injection with Lakera ChainGuard.
☆24Updated 3 months ago
leondz / lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
☆30Updated last year
microsoft / BIPIA
A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.
☆69Updated last year
dropbox / llm-security
Dropbox LLM Security research code and results
☆227Updated last year
agencyenterprise / PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…
☆381Updated last year
chawins / pal
PAL: Proxy-Guided Black-Box Attack on Large Language Models
☆51Updated 10 months ago
microsoft / TaskTracker
TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…
☆56Updated 3 months ago
Valhall-ai / prompt-injection-mitigations
A collection of prompt injection mitigation techniques.
☆22Updated last year
mnns / LLMFuzzer
🧠 LLMFuzzer - Fuzzing Framework for Large Language Models 🧠 LLMFuzzer is the first open-source fuzzing framework specifically designed …
☆282Updated last year
deadbits / vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
☆394Updated last year
OWASP / www-project-machine-learning-security-top-10
OWASP Machine Learning Security Top 10 Project
☆85Updated 4 months ago
mitre-atlas / atlas-data
ATLAS tactics, techniques, and case studies data
☆74Updated 2 months ago
RICommunity / TAP
TAP: An automated jailbreaking method for black-box LLMs
☆173Updated 6 months ago
GraySwanAI / nanoGCG
A fast + lightweight implementation of the GCG algorithm in PyTorch
☆246Updated last month
usnistgov / dioptra
Test Software for the Characterization of AI Technologies
☆258Updated this week
timothee-chauvin / eyeballvul
future-proof vulnerability detection benchmark, based on CVEs in open-source repos
☆56Updated last week
ethz-spylab / rlhf_trojan_competition
Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.
☆113Updated last year
andyzoujm / breaking-llama-guard
Code to break Llama Guard
☆31Updated last year
BishopFox / BrokenHill
A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)
☆120Updated 6 months ago
haizelabs / redteaming-resistance-benchmark
☆45Updated 10 months ago
trailofbits / awesome-ml-security
☆136Updated last month
vinusankars / BEAST
Implementation of BEAST adversarial attack for language models (ICML 2024)
☆88Updated last year
pralab / secml
A Python library for Secure and Explainable Machine Learning
☆180Updated this week
tldrsec / prompt-injection-defenses
Every practical and proposed defense against prompt injection.
☆488Updated 4 months ago