Code for the website www.jailbreakchat.com
☆124Aug 26, 2023Updated 2 years ago
Alternatives and similar repositories for jailbreakchat
Users that are interested in jailbreakchat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A set of functions for well-known Cumulative Distribution Function (CDF)-based distance measure☆15Jan 5, 2024Updated 2 years ago
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- [ICML 2025] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping".☆175May 2, 2025Updated last year
- Code for paper: Are Large Language Models Post Hoc Explainers?☆34Jul 22, 2024Updated last year
- ☆34Nov 26, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The official implementation of Self-aware Object Detection [CVPR 2023]☆13Jun 30, 2023Updated 2 years ago
- Collection of scripts for preparation of datasets for semantic segmentation of UAV images☆15Jun 21, 2022Updated 4 years ago
- ☆41May 25, 2024Updated 2 years ago
- A curated list of explainability-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to…☆59Jun 25, 2025Updated last year
- Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"☆22May 6, 2025Updated last year
- ☆33Jun 24, 2024Updated 2 years ago
- [ICCV 2023] HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness☆17Sep 28, 2023Updated 2 years ago
- ☆747Jul 2, 2025Updated 11 months ago
- Build a level 1 coding agent.☆17Jan 28, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]☆388Jan 23, 2025Updated last year
- Official Implementation of the paper: "A Rate-Distorion View of Uncertainty Quantification", ICML 2024☆28Sep 3, 2024Updated last year
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆15Dec 16, 2024Updated last year
- Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]☆79Jan 23, 2025Updated last year
- ☆60Jun 5, 2024Updated 2 years ago
- Debiasing Through Data Attribution☆13May 23, 2024Updated 2 years ago
- ☆53Feb 8, 2025Updated last year
- Evaluation of neuro-symbolic engines☆42Aug 3, 2024Updated last year
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆61Oct 1, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆88May 14, 2024Updated 2 years ago
- ⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs☆484Jan 31, 2024Updated 2 years ago
- This repository provides a benchmark for prompt injection attacks and defenses in LLMs☆463Oct 29, 2025Updated 8 months ago
- ☆11Jun 1, 2026Updated 3 weeks ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆50Apr 22, 2026Updated 2 months ago
- Quickly get custom prompt contexts☆14May 29, 2026Updated last month
- A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack–Defense Evaluation☆74May 8, 2026Updated last month
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆33Mar 30, 2026Updated 2 months ago
- Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks (Sun et al, 2021)☆13Oct 16, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Implementation of stop sequencer for Huggingface Transformers☆16Jun 6, 2023Updated 3 years ago
- A package that achieves 95%+ transfer attack success rate against GPT-4☆26Oct 24, 2024Updated last year
- Winning Hackathon entry for Streamlit LLM Hackathon October 2023☆16Oct 19, 2023Updated 2 years ago
- ☆143Jul 7, 2025Updated 11 months ago
- ☆131May 31, 2024Updated 2 years ago
- Postfix SMTP Smuggling - Expect Script POC☆23Dec 26, 2023Updated 2 years ago
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆18Jul 11, 2025Updated 11 months ago