AI45Lab/AgentDoG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AI45Lab/AgentDoG)

AI45Lab / AgentDoG

A Diagnostic Guardrail Framework for AI Agent Safety and Security

☆659

Alternatives and similar repositories for AgentDoG

Users that are interested in AgentDoG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LiYu0524 / ATbench
View on GitHub
ATBench: A Diverse and Realistic Agent Trajectory Benchmark for Safety Evaluation and Diagnosis
☆33Updated this week
shuyuehu / anti-laodeng
View on GitHub
anti-老登，反登味的飞书机器人。拒绝内耗，从我做起，让职场再无登味
☆28Apr 14, 2026Updated 3 months ago
domaineval / DomainEval
View on GitHub
DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …
☆13Dec 12, 2024Updated last year
Nebularaid2000 / rethink_sft_generalization
View on GitHub
Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"
☆108Apr 23, 2026Updated 2 months ago
AI45Lab / RelayRadar
View on GitHub
☆112Jun 24, 2026Updated 3 weeks ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
AI45Lab / SAfactory
View on GitHub
SAfactory: A Scalable Agentic Infrastructure for Training Trustworthy Autonomous Intelligence
☆154Updated this week
xsddys / TRACE
View on GitHub
TRACE, a framework for turn-aware credit assignment for multi-turn jailbreak optimization
☆19Jun 22, 2026Updated 3 weeks ago
AI45Lab / DEAN
View on GitHub
☆11Oct 25, 2024Updated last year
AI45Lab / Fake-Alignment
View on GitHub
☆17Mar 22, 2024Updated 2 years ago
AI45Lab / DeepScan
View on GitHub
Diagnostic Framework for LLMs and MLLMs
☆39Mar 2, 2026Updated 4 months ago
vlm2-bench / VLM2-Bench
View on GitHub
VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
☆45May 20, 2025Updated last year
lichengliu03 / unary-feedback
View on GitHub
☆44Mar 31, 2026Updated 3 months ago
aisa-group / skill-inject
View on GitHub
Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks
☆88Jul 1, 2026Updated last week
Junjie-Ye / MulDimIF
View on GitHub
[ACL 2026] A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models
☆23Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
AI45Lab / MLLMGuard
View on GitHub
☆46Jun 19, 2025Updated last year
ChnQ / TracingLLM
View on GitHub
☆30May 22, 2024Updated 2 years ago
AI45Lab / CodeAttack
View on GitHub
[ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion
☆61Oct 1, 2025Updated 9 months ago
thu-coai / BARREL
View on GitHub
[ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs
☆18May 21, 2025Updated last year
AI4Good24 / PsySafe
View on GitHub
☆53Feb 8, 2025Updated last year
song2yu / Mono2Stereo
View on GitHub
[CVPR25] Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
☆70May 19, 2026Updated last month
JiayuJeff / CostBench
View on GitHub
The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments…
☆32Jun 14, 2026Updated last month
ynulihao / AgentSkillOS
View on GitHub
Build your agent from 200,000+ skills via skill RETRIEVAL & ORCHESTRATION
☆550Mar 7, 2026Updated 4 months ago
pzs19 / LEMMA
View on GitHub
☆16Sep 4, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sxswz213 / DeepSlides
View on GitHub
☆27Apr 9, 2026Updated 3 months ago
AI45Lab / iDeer
View on GitHub
这倒是提醒我了
☆401May 12, 2026Updated 2 months ago
AI45Lab / OpenRT
View on GitHub
Open-source red teaming framework for MLLMs with 42+ attack methods
☆257Mar 25, 2026Updated 3 months ago
AI45Lab / VLSBench
View on GitHub
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
☆62Jul 21, 2025Updated 11 months ago
Fu-Dayuan / AgentRefine
View on GitHub
(ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning
☆20Nov 22, 2025Updated 7 months ago
falonss703 / Awesome-Uncertainty-based-Reinforcement-Learning
View on GitHub
🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL
☆58Aug 24, 2025Updated 10 months ago
X1AOX1A / Word2World
View on GitHub
[ACL 2026 Oral] From Word to World: Can Large Language Models be Implicit Text-based World Models?
☆66Apr 13, 2026Updated 3 months ago
Zavianx / vela
View on GitHub
Velaclaw — control plane for team AI
☆122Updated this week
stanford-crfm / air-bench-2024
View on GitHub
AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies
☆30Aug 14, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ai-agents-2030 / ViMo
View on GitHub
☆23Apr 2, 2026Updated 3 months ago
WINDGAND / Bailin
View on GitHub
60 秒造人，请上桌面 —— 开源本地 AI 桌宠，用 ta 的视角陪你看问题
☆55Updated this week
zzh-thu-22 / ExtendAttack
View on GitHub
[AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".
☆25Mar 18, 2026Updated 3 months ago
thu-coai / Backdoor-Data-Extraction
View on GitHub
☆33May 22, 2025Updated last year
ynulihao / LLMRouterBench
View on GitHub
[Findings@ACL'26] LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing
☆78Apr 6, 2026Updated 3 months ago
euxcet / thulearn2018
View on GitHub
Tools for Web Learning of Tsinghua University.
☆10Sep 17, 2024Updated last year
Eyr3 / TextCRS
View on GitHub
Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks (IEEE S&P 2024)
☆35Jun 29, 2025Updated last year