ReasoningShield: Safety Detection over Reasoning Traces of Large Reasoning Models
☆25Sep 27, 2025Updated 5 months ago
Alternatives and similar repositories for ReasoningShield
Users that are interested in ReasoningShield are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RSeata - A Rust implementation of distributed transaction framework, supporting AT & XA modes, SeaORM integration, and gRPC-based context…☆86Mar 10, 2026Updated 2 weeks ago
- Dual-Level Cross-Modality Neural Architecture Search for Guided Image Super-Resolution (TPAMI)☆50Nov 5, 2025Updated 4 months ago
- gauss-awesome-recommender-system-engine☆122Oct 6, 2025Updated 5 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- GOJ在线评测系统☆104Oct 30, 2025Updated 4 months ago
- A repository for evaluating large language models as raters in large-scale writing assessments, focusing on a psychometric framework for …☆82Jan 26, 2025Updated last year
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆282Updated this week
- ☆49Feb 25, 2026Updated last month
- Official Repository for Can Language Models be Instructed to Protect Personal Information?☆13Oct 8, 2023Updated 2 years ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Dec 1, 2025Updated 3 months ago
- A system that turns jailbreak papers into runnable attacks and benchmarks — live, as research evolves.☆24Mar 12, 2026Updated 2 weeks ago
- Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning☆187Mar 4, 2026Updated 3 weeks ago
- Full life cycle cross providers serverless application management for your fast-growing business.☆87Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14May 1, 2023Updated 2 years ago
- Optimizing Review Generation Through Prompt Generation☆15Apr 15, 2024Updated last year
- 一个现代化的终端应用程序,基于 Vue.js 和 Tauri 构建。☆50Feb 13, 2026Updated last month
- ☆41Oct 12, 2025Updated 5 months ago
- Scaling Agentic Environments Automatically.☆55Jan 22, 2026Updated 2 months ago
- Monitor LibertyCat NFT marketplace events — automatically track new listings and sales, and get instant email notifications.☆184Oct 27, 2025Updated 4 months ago
- An experimental edge key-value database built on top of FoundationDB.☆11Jan 9, 2025Updated last year
- PRG's Software and Hardware Framework for Quadrotors☆14Mar 24, 2021Updated 5 years ago
- 📚 TG-EDU综合教育平台 | 支持作业提交📝、批量评分✅、补交申请🔄、团队协作👥、成绩统计📊☆110Mar 14, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🎁 Modern e-commerce system built with Go (Gin + Gorm + Redis + JWT). Enhanced version of yshop-gin with improved UI, performance and fea…☆37Oct 17, 2025Updated 5 months ago
- Algorithms for The Travelling Salesman Problem☆13May 12, 2022Updated 3 years ago
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆52Oct 14, 2025Updated 5 months ago
- 一个在 JetBrains 上的插件:Tree Description 。可以为项目模块增加自定义备注,颜色分类、标注用途,还可以共享开源映射关系。☆212Jan 26, 2026Updated 2 months ago
- Astron-xmod-shim — Lightweight, declarative middleware for reliably converging AI service workloads.☆101Nov 3, 2025Updated 4 months ago
- ☆22Jan 29, 2026Updated last month
- A single WebTorrent client shared by all web pages and workers☆33Aug 19, 2022Updated 3 years ago
- FPGA Low latency 10GBASE-R PCS☆12May 23, 2023Updated 2 years ago
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆29Apr 2, 2025Updated 11 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A curated list of scientific and interdisciplinary research on AI existential risks, especially in the era of large models.☆14Jul 26, 2024Updated last year
- Fish bot for the MMORPG World of Warcraft☆10Aug 3, 2018Updated 7 years ago
- Distributed, consistent and highly available key-value storage system (using LevelDB) based on Raft consensus algorithm.☆10Apr 14, 2016Updated 9 years ago
- Official PyTorch implementation of our paper "Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World A…☆11Feb 8, 2023Updated 3 years ago
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆36Nov 11, 2025Updated 4 months ago
- Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings☆19Sep 1, 2025Updated 6 months ago
- [ICLR 2026] InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents☆128Feb 5, 2026Updated last month