AmenRa/GuardBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AmenRa/GuardBench)

AmenRa / GuardBench

A Python library for guardrail models evaluation.

☆37

Alternatives and similar repositories for GuardBench

Users that are interested in GuardBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AmenRa / a-multi-domain-benchmark-for-personalized-search-evaluation
View on GitHub
A Multi-domain Benchmark for Personalized Search Evaluation
☆12Sep 7, 2023Updated 2 years ago
ebegoli / SIGIR2022-Efficient-Transfomers
View on GitHub
☆21Jul 11, 2022Updated 4 years ago
sajastu / reddit_collector
View on GitHub
Reddit Collector and Text Processor
☆24Sep 7, 2022Updated 3 years ago
CrowdStrike / CyberSOCEval_data
View on GitHub
Data for CyberSOCEval, an LLM benchmark by Meta & CrowdStrike
☆22Sep 22, 2025Updated 10 months ago
SheltonLiu-N / Universal-Prompt-Injection
View on GitHub
The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".
☆73Oct 23, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
roywang021 / IDEATOR
View on GitHub
Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
☆18Jul 11, 2025Updated last year
MeteSertkan / ranger
View on GitHub
Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!
☆12Jun 12, 2023Updated 3 years ago
iai-group / sigir2018-table
View on GitHub
On-the-fly Table Generation - SIGIR'18
☆10Feb 1, 2020Updated 6 years ago
KutalVolkan / many-shot-jailbreaking-dataset
View on GitHub
Q&A dataset for many-shot jailbreaking
☆15Jul 19, 2024Updated 2 years ago
SmartDataAnalytics / Wikipedia_TF_IDF_Dataset
View on GitHub
Pre-computed IDF stats over all EN Wiki articles
☆13Jan 30, 2020Updated 6 years ago
Tele-EVOL / TeleAI-Safety
View on GitHub
☆27Jan 5, 2026Updated 6 months ago
mirzaeiyan / nqueens-genetic
View on GitHub
Solving the nqueens problem using genetic algorithm
☆12Dec 29, 2017Updated 8 years ago
D0miH / does-clip-know-my-face
View on GitHub
Source Code for the JAIR Paper "Does CLIP Know my Face?" (Demo: https://huggingface.co/spaces/AIML-TUDA/does-clip-know-my-face)
☆15Jul 9, 2024Updated 2 years ago
thongnt99 / lsr-multimodal
View on GitHub
ECIR 2024: Sparse lexical representation for image-text retrieval
☆13Jul 8, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AI45Lab / DEAN
View on GitHub
☆11Oct 25, 2024Updated last year
boudinfl / ir-using-kg
View on GitHub
Keyphrase Generation for Scientific Document Retrieval
☆11Oct 2, 2020Updated 5 years ago
mxzheng / TrojViT
View on GitHub
[CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang
☆15Jan 5, 2024Updated 2 years ago
simison / couchspinner
View on GitHub
Couchsurfing profile importer and previewer.
☆14Jan 8, 2023Updated 3 years ago
INESCTEC / kep
View on GitHub
Keyphase Extraction Package
☆10Aug 24, 2020Updated 5 years ago
ibm-granite / granite-guardian
View on GitHub
The Granite Guardian models are designed to detect risks in prompts and responses.
☆164May 5, 2026Updated 2 months ago
suzanv / PairwisePreferenceLearning
View on GitHub
Performs pairwise preference ranking for a given trainfile and testfile with binary class labels (1 and not 1). The binary classification…
☆14Jul 12, 2017Updated 9 years ago
chuhac / Reasoning-to-Defend
View on GitHub
[EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
☆12Aug 22, 2025Updated 11 months ago
ZrW00 / MuScleLoRA
View on GitHub
The code implementation of MuScleLoRA (Accepted in ACL 2024)
☆10Dec 1, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
rmit-ir / KrovetzStemmer
View on GitHub
Python binding to the KrovetzStemmer package (C++ version)
☆14Feb 12, 2023Updated 3 years ago
thunxxx / MLLM-Jailbreak-evaluation-MMJ-Bench
View on GitHub
☆81Mar 30, 2025Updated last year
oxai / visogender
View on GitHub
☆13May 10, 2025Updated last year
chaoqi7 / BSA-CIL-3D
View on GitHub
Boosting the Class-Incremental Learning in 3D Point Clouds via Zero-Collection-Cost Basic Shape Pre-Training
☆13Nov 30, 2024Updated last year
harpribot / harpreif
View on GitHub
Deep Learning - Visual Representation Learning by solving Jigsaw puzzles using Deep Reinforcement Learning
☆10Dec 8, 2016Updated 9 years ago
psunlpgroup / FoVer
View on GitHub
This repository includes code and materials for the paper "Efficient PRM Training Data Synthesis via Formal Verification" (ACL 2026 Findi…
☆18Apr 7, 2026Updated 3 months ago
begab / mamus
View on GitHub
Source code accompanying the ICLR2020 publication 'Massively Multilingual Sparse Word Representations' https://openreview.net/forum?id=Hy…
☆12Aug 15, 2023Updated 2 years ago
CryptoAILab / MergeGuard
View on GitHub
[CCS-LAMPS'24] LLM IP Protection Against Model Merging
☆16Oct 14, 2024Updated last year
gregdeon / spotlight
View on GitHub
Implementation of the spotlight: a method for discovering systematic errors in deep learning models
☆11Oct 5, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AmenRa / indxr
View on GitHub
A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.
☆23Nov 9, 2025Updated 8 months ago
LLMSmith / LLMSmith
View on GitHub
☆50Feb 26, 2025Updated last year
quadrismegistus / lltk
View on GitHub
Literary Language Toolkit: code, models, corpora, and web tools
☆11Jul 5, 2026Updated 2 weeks ago
allenai / safety-eval
View on GitHub
A simple evaluation of generative language models and safety classifiers.
☆105Jun 16, 2026Updated last month
WuraolaOyewusi / How-to-use-ScispaCy-for-Biomedical-Named-Entity-Recognition-Abbreviation-Resolution-and-link-UMLS
View on GitHub
☆10Aug 11, 2019Updated 6 years ago
facebookresearch / SecAlign
View on GitHub
Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"
☆98Jul 2, 2026Updated 3 weeks ago
jianlins / FastContext
View on GitHub
FastContext is an optimized Java implementation of ConText algorithm (https://www.ncbi.nlm.nih.gov/pubmed/23920642).
☆14Oct 19, 2021Updated 4 years ago