☆12Sep 29, 2024Updated last year
Alternatives and similar repositories for STAIR-LLMGuardrails
Users that are interested in STAIR-LLMGuardrails are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).☆16Jul 4, 2024Updated last year
- Pytorch Implmentation of Meta Attack via Contrastive Surrogate Objective☆12May 21, 2024Updated last year
- 提示词泄露攻击(Prompt Leaking Attack)☆26Jan 28, 2026Updated 2 months ago
- LLM Self Defense: By Self Examination, LLMs know they are being tricked☆51May 21, 2024Updated last year
- JailBench:大型语言模型越狱攻击风险评测中文数据集 [PAKDD 2025]☆171Mar 3, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EaTVul: ChatGPT-based Evasion Attack Against Software Vulnerability Detection☆18Jan 6, 2025Updated last year
- Red Queen Dataset and data generation template☆26Dec 26, 2025Updated 3 months ago
- [CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment☆27Jun 11, 2025Updated 10 months ago
- Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector☆11Jun 24, 2023Updated 2 years ago
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆20Mar 25, 2024Updated 2 years ago
- SecureDNA client and server components monorepo☆17Oct 20, 2025Updated 5 months ago
- The most comprehensive and accurate LLM jailbreak attack benchmark by far☆21Mar 22, 2025Updated last year
- Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'☆24Jun 9, 2024Updated last year
- ☆13Feb 21, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- User Interface Design & Evaluation☆11Dec 17, 2018Updated 7 years ago
- An easy-to-use Python framework to defend against jailbreak prompts.☆21Mar 22, 2025Updated last year
- Implementation for "RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content"☆23Jul 28, 2024Updated last year
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- SecProbe:任务驱动式大模型安全能力 评测系统☆15Nov 29, 2024Updated last year
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- ☆25Jun 16, 2024Updated last year
- A curated collection of papers and related projects on using LLMs for privacy.☆29Oct 8, 2025Updated 6 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆97May 23, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12May 19, 2021Updated 4 years ago
- One prompt, three pipelines: Flow / CAD / PPT Flow: Conversational flowchart generation with patch / replace CAD: Interior design → ana…☆69Mar 24, 2026Updated 3 weeks ago
- Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses v…☆12Oct 30, 2023Updated 2 years ago
- [Neurips’25] Code for the paper "Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization"☆31Sep 25, 2025Updated 6 months ago
- Eagle is a Web Application Attack and Audit Framework. Eagle has moved to Bitbucket.☆11Nov 21, 2016Updated 9 years ago
- Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos☆12Oct 8, 2020Updated 5 years ago
- How to write an academic paper☆11Oct 20, 2022Updated 3 years ago
- Face detection and recognition☆24Mar 12, 2015Updated 11 years ago
- ☆41Dec 9, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Sep 2, 2025Updated 7 months ago
- POINTS-Reader train☆20Sep 20, 2025Updated 6 months ago
- NVIDIA’s repository for enabling trustworthy AI.☆31Apr 7, 2026Updated last week
- ☆34Nov 12, 2024Updated last year
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24May 5, 2025Updated 11 months ago
- Code for paper "Defending aginast LLM Jailbreaking via Backtranslation"☆34Aug 16, 2024Updated last year
- Webapplication Honeypot☆16May 12, 2013Updated 12 years ago