[ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".
☆168May 19, 2025Updated 9 months ago
Alternatives and similar repositories for GuardReasoner
Users that are interested in GuardReasoner are comparing it to the libraries listed below
Sorting:
- Help you practice daily English speaking and conversation skills painlessly from easy to difficult☆64Apr 25, 2025Updated 10 months ago
- ⚡️A multilingual CI linter for eliminating language barriers in global development.☆32Dec 17, 2025Updated 2 months ago
- 手搓云计算运维开发 第一阶段私有云Dashboard 第二阶段CICD☆35Dec 19, 2024Updated last year
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆236Jun 14, 2025Updated 8 months ago
- A C++ implementation of Open Interpreter. / Open Interpreter 的 C++ 实现☆63Nov 13, 2025Updated 3 months ago
- Metrics for Go — lightweight, concurrent-safe, and with built-in support for exporting Counters, Gauges, and Timers to DataDog via DogSta…☆41Jun 8, 2025Updated 8 months ago
- ACL 2025 (Main) HiddenDetect: Detecting Jailbreak Attacks against Multimodal Large Language Models via Monitoring Hidden States☆159Jun 8, 2025Updated 8 months ago
- 利用Python实现的DBMS☆15May 16, 2023Updated 2 years ago
- Predict stock prices using Long Short-Term Memory (LSTM) networks.☆53Oct 19, 2023Updated 2 years ago
- a cli to initialize project.(React | Vue3 | lib)☆24Jan 23, 2025Updated last year
- A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]☆122May 17, 2025Updated 9 months ago
- Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, data…☆1,221Feb 6, 2026Updated 3 weeks ago
- Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …☆207Dec 15, 2025Updated 2 months ago
- Fast, stateless gateway with HMAC-based token auth, request-level tracing, and vector-ready logs.☆29May 13, 2025Updated 9 months ago
- Vim mode for VSCode, run Vim/Nvim in integrated terminal with seamless switching☆120Apr 30, 2025Updated 10 months ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆25Nov 29, 2024Updated last year
- [AAAI 2021] VMLoc: Variational Fusion For Learning-Based Multimodal Camera Localization☆31Oct 27, 2024Updated last year
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆26Feb 25, 2025Updated last year
- ☆104Jan 24, 2025Updated last year
- Backend Service of the Flow Orchestration Platform: an open-source and powerful workflow orchestration platform that is simple, user-frie…☆20Jul 16, 2023Updated 2 years ago
- A React-based virtual avatar component for real-time gameplay analysis and emotional support. Integrate with screen capture to provide in…☆149Jan 9, 2025Updated last year
- kotlin util collection☆20Mar 30, 2024Updated last year
- Python based Dex/Pancakeswap bot (GUI version), support multi wallets, intergated with Honeypot checker, approve, buy and sell function☆23Apr 19, 2023Updated 2 years ago
- ☆134Feb 15, 2025Updated last year
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated 9 months ago
- This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"☆54Feb 2, 2025Updated last year
- ☆98Mar 8, 2025Updated 11 months ago
- ☆144May 6, 2025Updated 9 months ago
- ☆80Jun 8, 2025Updated 8 months ago
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆58Oct 1, 2025Updated 5 months ago
- A header-only C++11 thread pool based on stack-allocated task containers☆80Jan 27, 2026Updated last month
- [NAACL 2025] SIUO: Cross-Modality Safety Alignment☆123Jan 31, 2025Updated last year
- This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among differe…☆91Apr 13, 2024Updated last year
- PhishIntention: Phishing detection through webpage intention☆255Jan 5, 2026Updated 2 months ago
- Repository of "Modal-NexT: toward unified heterogeneous cellular data integration"☆86Jun 16, 2025Updated 8 months ago
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- Recipes to train the self-rewarding reasoning LLMs.☆231Mar 2, 2025Updated last year
- QRec is an algorithm that helps you quickly find the largest fixed-aspect, axis-aligned rectangle that can be inscribed in any given poly…☆27Jun 25, 2025Updated 8 months ago
- ☆19Apr 26, 2025Updated 10 months ago