Giskard-AI / awesome-ai-safetyLinks

📚 A curated list of papers & technical articles on AI Quality & Safety

☆193

Alternatives and similar repositories for awesome-ai-safety

Users that are interested in awesome-ai-safety are comparing it to the libraries listed below

Sorting:

fiddler-labs / fiddler-auditor
Fiddler Auditor is a tool to evaluate language models.
☆188Updated last year
stanford-crfm / ecosystem-graphs
☆268Updated 9 months ago
neuml / txtinstruct
📚 Datasets and models for instruction-tuning
☆237Updated 2 years ago
rungalileo / hallucination-index
Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆115Updated 3 months ago
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆235Updated 9 months ago
stanford-crfm / EUAIActJune15
Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act
☆93Updated 2 years ago
agencyenterprise / PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…
☆433Updated last year
arthur-ai / bench
A tool for evaluating LLMs
☆427Updated last year
compl-ai / compl-ai
An open-source compliance-centered evaluation framework for Generative AI models
☆170Updated this week
gretelai / awesome-synthetic-data
📖 A curated list of resources dedicated to synthetic data
☆138Updated 3 years ago
Data-Provenance-Initiative / Data-Provenance-Collection
☆256Updated 7 months ago
alopatenko / LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…
☆149Updated this week
stanford-crfm / fmti
The Foundation Model Transparency Index
☆83Updated last year
patronus-ai / Lynx-hallucination-detection
☆43Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆109Updated 11 months ago
IBM / unitxt
🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …
☆211Updated last week
raga-ai-hub / raga-llm-hub
Framework for LLM evaluation, guardrails and security
☆113Updated last year
Libr-AI / do-not-answer
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
☆294Updated last year
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆667Updated last year
aiverify-foundation / aiverify
AI Verify
☆37Updated 2 weeks ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆50Updated last year
jongjyh / TrFr
Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning
☆46Updated last year
normster / llm_rules
RuLES: a benchmark for evaluating rule-following in language models
☆239Updated 8 months ago
zetaalphavector / RAGElo
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
☆123Updated 2 weeks ago
amazon-science / CodeSage
CodeSage: Code Representation Learning At Scale (ICLR 2024)
☆114Updated last year
lxx0628 / Prompting-Framework-Survey
A curated list of awesome publications and researchers on prompting framework updated and maintained by The Intelligent System Security (…
☆86Updated 10 months ago
LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆305Updated last year
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆305Updated this week
taylorai / galactic
data cleaning and curation for unstructured text
☆329Updated last year
haizelabs / redteaming-resistance-benchmark
☆49Updated last year