☆12Sep 29, 2024Updated last year
Alternatives and similar repositories for STAIR-LLMGuardrails
Users that are interested in STAIR-LLMGuardrails are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).☆16Jul 4, 2024Updated last year
- Pytorch Implmentation of Meta Attack via Contrastive Surrogate Objective☆12May 21, 2024Updated last year
- 提示词泄露攻击(Prompt Leaking Attack)☆26Jan 28, 2026Updated last month
- LLM Self Defense: By Self Examination, LLMs know they are being tricked☆51May 21, 2024Updated last year
- JailBench:大型语言模型越狱攻击风险评测中文数据集 [PAKDD 2025]☆170Mar 3, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- EaTVul: ChatGPT-based Evasion Attack Against Software Vulnerability Detection☆18Jan 6, 2025Updated last year
- Red Queen Dataset and data generation template☆27Dec 26, 2025Updated 3 months ago
- [CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment☆27Jun 11, 2025Updated 9 months ago
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆20Mar 25, 2024Updated 2 years ago
- Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector☆11Jun 24, 2023Updated 2 years ago
- SecureDNA client and server components monorepo☆16Oct 20, 2025Updated 5 months ago
- The most comprehensive and accurate LLM jailbreak attack benchmark by far☆22Mar 22, 2025Updated last year
- Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'☆23Jun 9, 2024Updated last year
- ☆13Feb 21, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- User Interface Design & Evaluation☆11Dec 17, 2018Updated 7 years ago
- An easy-to-use Python framework to defend against jailbreak prompts.☆21Mar 22, 2025Updated last year
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- Implementation for "RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content"☆23Jul 28, 2024Updated last year
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- SecProbe:任务驱动式大模型安全能力评测系统☆15Nov 29, 2024Updated last year
- ☆25Jun 16, 2024Updated last year
- A curated collection of papers and related projects on using LLMs for privacy.☆26Oct 8, 2025Updated 5 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆98May 23, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12May 19, 2021Updated 4 years ago
- Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses v…☆12Oct 30, 2023Updated 2 years ago
- One prompt, three pipelines: Flow / CAD / PPT Flow: Conversational flowchart generation with patch / replace CAD: Interior design → ana…☆70Mar 19, 2026Updated last week
- Eagle is a Web Application Attack and Audit Framework. Eagle has moved to Bitbucket.☆11Nov 21, 2016Updated 9 years ago
- Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos☆12Oct 8, 2020Updated 5 years ago
- How to write an academic paper☆11Oct 20, 2022Updated 3 years ago
- NVIDIA’s repository for enabling trustworthy AI.☆28Mar 3, 2026Updated 3 weeks ago
- Face detection and recognition☆24Mar 12, 2015Updated 11 years ago
- ☆41Dec 9, 2025Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Sep 2, 2025Updated 6 months ago
- POINTS-Reader train☆20Sep 20, 2025Updated 6 months ago
- ☆34Nov 12, 2024Updated last year
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24May 5, 2025Updated 10 months ago
- Code for paper "Defending aginast LLM Jailbreaking via Backtranslation"☆34Aug 16, 2024Updated last year
- Webapplication Honeypot☆15May 12, 2013Updated 12 years ago
- Playing around with various jailbreaking techniques ahead of the Gray Swan AI Ultimate Jailbreaking Competition☆18Oct 6, 2024Updated last year