aiverify-foundation / aiverifyLinks

AI Verify

☆35

Alternatives and similar repositories for aiverify

Users that are interested in aiverify are comparing it to the libraries listed below

Sorting:

aiverify-foundation / moonshot
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
☆278Updated last month
compl-ai / compl-ai
An open-source compliance-centered evaluation framework for Generative AI models
☆168Updated this week
fiddler-labs / fiddler-auditor
Fiddler Auditor is a tool to evaluate language models.
☆188Updated last year
usnistgov / dioptra
Test Software for the Characterization of AI Technologies
☆261Updated this week
lve-org / lve
A repository of Language Model Vulnerabilities and Exposures (LVEs).
☆112Updated last year
haizelabs / redteaming-resistance-benchmark
☆47Updated last year
Giskard-AI / awesome-ai-safety
📚 A curated list of papers & technical articles on AI Quality & Safety
☆193Updated 6 months ago
aiverify-foundation / moonshot-data
Contains all assets to run with Moonshot Library (Connectors, Datasets and Metrics)
☆37Updated last month
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆219Updated 8 months ago
cvs-health / langfair
LangFair is a Python library for conducting use-case level LLM bias and fairness assessments
☆236Updated last week
lakeraai / pint-benchmark
A benchmark for prompt injection detection systems.
☆144Updated last month
ethz-spylab / agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆318Updated this week
statice / awesome-synthetic-data
A curated list of awesome synthetic data tools (open source and commercial).
☆212Updated last year
VectorInstitute / fed-rag
A framework for fine-tuning retrieval-augmented generation (RAG) systems.
☆130Updated this week
hwchase17 / langfuzz
☆73Updated 11 months ago
ibm-granite / granite-guardian
The Granite Guardian models are designed to detect risks in prompts and responses.
☆119Updated last week
ZenGuard-AI / fast-llm-security-guardrails
The fastest Trust Layer for AI Agents
☆143Updated 4 months ago
invariantlabs-ai / invariant
Guardrails for secure and robust agent development
☆351Updated 2 months ago
BCG-X-Official / artkit
Automated prompt-based testing and evaluation of Gen AI applications
☆153Updated 7 months ago
holistic-ai / holisticai
This is an open-source tool to assess and improve the trustworthiness of AI systems.
☆99Updated last month
stanford-crfm / EUAIActJune15
Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act
☆93Updated 2 years ago
whylabs / langkit
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring sa…
☆952Updated 10 months ago
deadbits / vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
☆419Updated last year
mlcommons / modelbench
Run safety benchmarks against AI models and view detailed reports showing how well they performed.
☆107Updated this week
prompt-security / ps-fuzz
Make your GenAI Apps Safe & Secure Test & harden your system prompt
☆575Updated 3 weeks ago
arthur-ai / bench
A tool for evaluating LLMs
☆424Updated last year
UKGovernmentBEIS / inspect_evals
Collection of evals for Inspect AI
☆254Updated this week
agencyenterprise / PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…
☆424Updated last year
safellama / plexiglass
A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).
☆149Updated last year
confident-ai / deepteam
DeepTeam is a framework to red team LLMs and LLM systems.
☆766Updated this week