The jailbreak-evaluation is an easy-to-use Python package for language model jailbreak evaluation.
☆27Nov 4, 2024Updated last year
Alternatives and similar repositories for jailbreak-evaluation
Users that are interested in jailbreak-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Adaptive Verification of Patches at the Binary Level☆14Mar 19, 2026Updated 2 months ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- [TMLR 2025] Official implementation of AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation☆26Jun 17, 2025Updated 11 months ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆14Feb 21, 2025Updated last year
- Web2 bug bounty Agent Skill — evidence-based, no AI slop. Covers 18 vulnerability classes across HackerOne, Bugcrowd, Intigriti, and YesW…☆51Feb 21, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.…☆28Sep 25, 2024Updated last year
- valve source engine hooking on OS X using libembryo, no sdk required☆10Sep 21, 2016Updated 9 years ago
- ☆19Jul 7, 2025Updated 11 months ago
- Multi-head Recurrent Layer Attention for Vision Network☆23Mar 2, 2023Updated 3 years ago
- The repo of the Doc2SoarGraph framework☆10Sep 17, 2024Updated last year
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆39Jun 13, 2022Updated 3 years ago
- 绝地求生的数据抓取☆17Sep 25, 2017Updated 8 years ago
- ☆10Jan 15, 2018Updated 8 years ago
- Transfer Learning in Dialogue Benchmarking Toolkit☆14Mar 31, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Feb 22, 2023Updated 3 years ago
- ☆26Mar 4, 2022Updated 4 years ago
- ☆12Sep 23, 2024Updated last year
- 🎙️ 一个全自动的学术论文播客生成系统,支持从arXiv网站爬取最新科技资讯,使用LLM生成结构化对话脚本,并通过语音合成技术输出专业的播客音频。集新闻采集、内容生成、语音合成于一体的AI播客工具。☆24Nov 1, 2024Updated last year
- Slides and videos from talks given at cons☆26Jun 19, 2025Updated 11 months ago
- Generate security policies and documents based on KPNs templates.☆41Oct 7, 2019Updated 6 years ago
- NAACL 2022 paper on Analyzing Modality Robustness in Multimodal Sentiment Analysis☆31Jan 21, 2023Updated 3 years ago
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- Working through the exercises from Seven Languages In Seven Weeks☆27Jun 15, 2013Updated 12 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- collection of fonts for Uyghur arabic script☆14Feb 4, 2019Updated 7 years ago
- Minor changes to Richard Socher's Recursive Autoenocder code to work with GNU Octave. (http://www.socher.org/index.php/Main/DynamicPoolin…☆16Feb 27, 2014Updated 12 years ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Aug 9, 2024Updated last year
- Official implementation of ECML PKDD'24 paper 'Self-Supervised Spatial-Temporal Normality Learning for Time Series Anomaly Detection'.☆18Aug 17, 2024Updated last year
- ☆10Jun 24, 2024Updated last year
- source code of paper "Mapping to Bits: Efficiently Detecting Type Confusion Errors"☆14Dec 23, 2018Updated 7 years ago
- Unofficial pytorch implementation of IPOT for improved Seq2Seq Learning☆14Dec 4, 2021Updated 4 years ago
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- ☆15Dec 8, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆42Apr 14, 2020Updated 6 years ago
- JavaScript files used to bypass Root Detection & SSL Pinning in Frida.☆15Sep 12, 2022Updated 3 years ago
- Sukoshi is a proof-of-concept Python/C++ implant that leverages the MQTT protocol for C2 and uses AWS IoT Core as infrastructure.☆49Mar 26, 2022Updated 4 years ago
- implement n2nmn with pytorch☆19Apr 10, 2019Updated 7 years ago
- Neovim Bun client.☆33Jun 3, 2026Updated last week
- ☆20Jan 27, 2026Updated 4 months ago
- Safety-J: Evaluating Safety with Critique☆16Jul 28, 2024Updated last year