whitecircle/circle-guard-bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/whitecircle/circle-guard-bench)

whitecircle / circle-guard-bench

First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and safeguards)

☆70

Alternatives and similar repositories for circle-guard-bench

Users that are interested in circle-guard-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DeevsDeevs / agent-system
View on GitHub
Just a set of claude/codex skills to be 10x Deevs' engineer
☆40Mar 20, 2026Updated 4 months ago
advpropsys / tinymem
View on GitHub
Self hosted Claude Code shared memory & artifact storage in ~1k LOC.
☆22Jan 10, 2026Updated 6 months ago
VikhrModels / DOoM
View on GitHub
Бенчмарк для оценки способности языковых моделей решать математические и физические задачи на русском языке
☆22Nov 14, 2025Updated 8 months ago
d9d-project / d9d
View on GitHub
d9d - d[istribute]d - distributed training framework based on PyTorch that tries to be efficient yet hackable
☆26Updated this week
robomotic / awesome-guide-ai-safety
View on GitHub
☆13Jun 7, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Lercas / MLSec-Docs
View on GitHub
MlSec document RU
☆17Nov 9, 2025Updated 8 months ago
humane-intelligence / ai_village_defcon_grt_data
View on GitHub
☆15Jun 7, 2024Updated 2 years ago
DeevsDeevs / dotfiles
View on GitHub
Deevs' dotfiles
☆20Updated this week
dayyass / language-modeling
View on GitHub
Pipeline for training Language Models using PyTorch.
☆12May 24, 2022Updated 4 years ago
KonderLip / data-fusion2024-geo
View on GitHub
Open solution for geo task from Data Fusion Contest 2024
☆14Mar 1, 2024Updated 2 years ago
dreadnode / example-agents
View on GitHub
Example agents for the Dreadnode platform
☆34Dec 19, 2025Updated 7 months ago
Blucknote / Kandinsky-advanced-notebooks
View on GitHub
Notebooks with additional features to run Kandinsky
☆14May 15, 2023Updated 3 years ago
VikhrModels / Salt
View on GitHub
☆60Dec 17, 2025Updated 7 months ago
weizeming / momentum-attack-llm
View on GitHub
☆25Jan 17, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
RapidResponseBench / rapidresponsebench
View on GitHub
☆35Nov 12, 2024Updated last year
VikhrModels / effective_llm_alignment
View on GitHub
Effective LLM Alignment Toolkit
☆153Jun 25, 2025Updated last year
NaturalCycles / MixBABA
View on GitHub
A tool for making AB tests with Mixpanel API
☆12Jan 25, 2019Updated 7 years ago
lanesket / llm.log
View on GitHub
Know what you spend, see what you send. Lightweight local proxy that logs every LLM call - costs, tokens, full prompts and responses.
☆18Apr 4, 2026Updated 3 months ago
NeuralPushkin / Dalle2-Decoder
View on GitHub
Dalle2-Decoder for image generation tasks
☆19May 19, 2022Updated 4 years ago
RussianNLP / DRAGON
View on GitHub
RAG benchmark
☆32Feb 6, 2026Updated 5 months ago
ClawGym / ClawGym-Bench
View on GitHub
☆18May 15, 2026Updated 2 months ago
Bots-Avatar / ExplainitAll
View on GitHub
ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…
☆19Oct 11, 2024Updated last year
iamtrask / decentralized-ai-from-scratch
View on GitHub
☆19Apr 8, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
haizelabs / bijection-learning
View on GitHub
☆29Oct 22, 2024Updated last year
aniket-work / AI_Powered_Dev_Search_Engine
View on GitHub
AI_Powered_Dev_Search_Engine
☆12Mar 10, 2024Updated 2 years ago
DS3Lab / CocktailSGD
View on GitHub
☆27Aug 25, 2023Updated 2 years ago
Aloriosa / srmt
View on GitHub
The original Shared Recurrent Memory Transformer implementation
☆36Jul 11, 2025Updated last year
princeton-polaris-lab / Evaluating-Durable-Safeguards
View on GitHub
[ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs
☆13Jun 20, 2025Updated last year
OWASP / www-project-llm-verification-standard
View on GitHub
OWASP LLM Security Verification Standard
☆57May 11, 2026Updated 2 months ago
IlyaGusev / codearkt
View on GitHub
Implementation of the CodeAct agentic framework with Docker containers for security, MCP servers for tool integrations, and multi-agent s…
☆40Oct 22, 2025Updated 9 months ago
xiamengzhou / training_trajectory_analysis
View on GitHub
[ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
☆25Nov 14, 2023Updated 2 years ago
lakeraai / dsec-gandalf
View on GitHub
☆24Mar 18, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
LautaroParada / variance-test
View on GitHub
A Python package for variance ratio testing, weak-form efficiency diagnostics, rolling-window analysis, and simulation-based research on …
☆13Apr 10, 2026Updated 3 months ago
edgeimpulse / example-custom-ml-block-keras
View on GitHub
Custom Keras ML block example for Edge Impulse
☆12Updated this week
jotaf98 / shareddataset
View on GitHub
A PyTorch Dataset that caches samples in shared memory, accessible globally to all processes
☆25May 11, 2022Updated 4 years ago
isadrtdinov / bootcamp-idao-2022
View on GitHub
IDAO 2022: Machine Learning Bootcamp
☆19Dec 4, 2021Updated 4 years ago
jerryjliu / classify_extract_sec
View on GitHub
☆16Nov 9, 2025Updated 8 months ago
Meirtz / BabyBLUE-llm
View on GitHub
[COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…
☆12Jul 26, 2024Updated 2 years ago
shehper / AC-Solver
View on GitHub
A long-horizon, sparse-reward math environment for reinforcement learning. Official code repo for "What makes Math problems hard for rein…
☆36Aug 11, 2025Updated 11 months ago