yueliu1999/GuardReasoner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yueliu1999/GuardReasoner)

yueliu1999 / GuardReasoner

[ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".

☆175

Alternatives and similar repositories for GuardReasoner

Users that are interested in GuardReasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yueliu1999 / Awesome-Efficient-Inference-for-LRMs
View on GitHub
[IEEE T-PAMI] Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Larg…
☆238Jun 13, 2026Updated last month
2001wjh / ChatMaster
View on GitHub
Help you practice daily English speaking and conversation skills painlessly from easy to difficult
☆63Apr 25, 2025Updated last year
yueliu1999 / GuardReasoner-VL
View on GitHub
[NeurIPS 2025] An official source code for paper "GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning".
☆123Feb 22, 2026Updated 5 months ago
xiegangqingnian1021 / devops
View on GitHub
手搓云计算运维开发第一阶段私有云Dashboard 第二阶段CICD
☆35Dec 19, 2024Updated last year
kangmintong / R-2-Guard
View on GitHub
[ICLR 2025] Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning
☆23Jul 8, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ystemsrx / code-atlas
View on GitHub
A C++ implementation of Open Interpreter. / Open Interpreter 的 C++ 实现
☆63Nov 13, 2025Updated 8 months ago
wy-z / vscode-vim-mode
View on GitHub
Vim mode for VSCode, run Vim/Nvim in integrated terminal with seamless switching
☆121Apr 30, 2025Updated last year
Ray7788 / Stock-Price-Forecast
View on GitHub
Predict stock prices using Long Short-Term Memory (LSTM) networks.
☆53Oct 19, 2023Updated 2 years ago
0xjeffro / sentrix
View on GitHub
Fast, stateless gateway with HMAC-based token auth, request-level tracing, and vector-ready logs.
☆30May 13, 2025Updated last year
chuhac / Reasoning-to-Defend
View on GitHub
[EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
☆12Aug 22, 2025Updated 10 months ago
leigest519 / HiddenDetect
View on GitHub
ACL 2025 (Main) HiddenDetect: Detecting Jailbreak Attacks against Multimodal Large Language Models via Monitoring Hidden States
☆165Jun 8, 2025Updated last year
JusperLee / AudioTrust
View on GitHub
AudioTrust: Benchmarking the Multi-faceted Trustworthiness of Audio Large Language Models
☆215Jan 28, 2026Updated 5 months ago
rainbowyuyu / manim_extend_rainbow
View on GitHub
Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …
☆206Dec 15, 2025Updated 7 months ago
yakiisama / taco-launch
View on GitHub
a cli to initialize project.(React | Vue3 | lib)
☆24Jan 23, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
s3ndd / sen-graphql-go
View on GitHub
☆80Jun 8, 2025Updated last year
yueliu1999 / Awesome-Jailbreak-on-LLMs
View on GitHub
Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, data…
☆1,536Jun 7, 2026Updated last month
hkvincent / vpeodometer
View on GitHub
the pedometer with excitation system
☆30Oct 29, 2021Updated 4 years ago
wenlongliaoEE / loadforecast
View on GitHub
☆105Jan 24, 2025Updated last year
HanjiangHu / NBF-LLM
View on GitHub
The official code for "Steering Dialogue Dynamics for Robustness against Multi-turn Jailbreaking Attacks".
☆18Jun 24, 2026Updated 3 weeks ago
yizems / KUtil
View on GitHub
kotlin util collection
☆20Mar 30, 2024Updated 2 years ago
YesuLabs / contracts
View on GitHub
☆98Mar 8, 2025Updated last year
microsoft / MMLU-CF
View on GitHub
A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]
☆125May 17, 2025Updated last year
s3ndd / gometricus
View on GitHub
Metrics for Go — lightweight, concurrent-safe, and with built-in support for exporting Counters, Gauges, and Timers to DataDog via DogSta…
☆41Jun 8, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xw-an / arcade-x6-api
View on GitHub
Backend Service of the Flow Orchestration Platform: an open-source and powerful workflow orchestration platform that is simple, user-frie…
☆20Jul 16, 2023Updated 3 years ago
Irreel / AnyActions
View on GitHub
☆132Feb 15, 2025Updated last year
lava-security-research / forge-framework
View on GitHub
Top 10 Data Centers & AI Infrastructure Security Risks
☆16Updated this week
Ubheee / chainainexus-cloud
View on GitHub
☆19Apr 26, 2025Updated last year
AI45Lab / CodeAttack
View on GitHub
[ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion
☆61Oct 1, 2025Updated 9 months ago
rps-agents / agi-game-live
View on GitHub
A React-based virtual avatar component for real-time gameplay analysis and emotional support. Integrate with screen capture to provide in…
☆148Jan 9, 2025Updated last year
blain3white / next-fast-table
View on GitHub
☆90Jul 23, 2024Updated last year
garlic-byte / RL-LLM
View on GitHub
强化学习-大语言模型
☆68Jun 17, 2025Updated last year
corescriptions / indexer
View on GitHub
Inscriptions on CoreDao, powered by Insdexer.
☆147Mar 20, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Crypto-KK / Pancakeswap-bot
View on GitHub
Python based Dex/Pancakeswap bot (GUI version), support multi wallets, intergated with Honeypot checker, approve, buy and sell function
☆23Apr 19, 2023Updated 3 years ago
orchain / prysm
View on GitHub
☆296Sep 14, 2025Updated 10 months ago
Nonac / LXD_Build
View on GitHub
This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among differe…
☆91Apr 13, 2024Updated 2 years ago
s3ndd / cryptor
View on GitHub
`cryptor` is a Go package for secure encryption and decryption using NaCl's `secretbox` from `golang.org/x/crypto`
☆60Jun 8, 2025Updated last year
dubbenexus / insmess-speech
View on GitHub
即迅语音识别服务，支持语音识别（ASR）、语音合成（TTS）、声纹识别（VPR）等功能，适配国产化arm操作系统，支持CPU快速语音识别
☆74Jul 15, 2024Updated 2 years ago
Credit-card-monitoring-and-fraud-check / Credit_card_monitoring_and_check
View on GitHub
A code repository designed to show the best GitHub has to offer.
☆165Jun 30, 2024Updated 2 years ago
risesoft-y9 / Email
View on GitHub
电子邮件是一款简化的具备邮件服务器的企业邮箱，支持在将其他主流邮箱的邮件进行导入后自主控制邮件数据安全。电子邮件具备较为简洁的界面风格，以其简洁精确的功能和小巧安全的架构便于企业和政府根据业务要求进行二次开发。电子邮件需要依赖开源的数字底座进行人员岗位管控。
☆370Jul 8, 2026Updated 2 weeks ago