A Domain-Specific Language, Jailbreak Attack Synthesizer and Dynamic LLM Redteaming Toolkit
☆27Dec 5, 2024Updated last year
Alternatives and similar repositories for h4rm3l
Users that are interested in h4rm3l are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 1, 2022Updated 3 years ago
- ☆10Nov 8, 2022Updated 3 years ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆20Oct 19, 2025Updated 5 months ago
- A Dataset and Results for Classifying Emotions Across Languages☆10Jun 20, 2021Updated 4 years ago
- Belief in the Machine: Investigating Epistemological Blind Spots of Language Models☆32Apr 19, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [CVPR 2025] Official implementation for JOOD "Playing the Fool: Jailbreaking LLMs and Multimodal LLMs with Out-of-Distribution Strategy"☆22Jun 11, 2025Updated 9 months ago
- ☆17Feb 4, 2025Updated last year
- ☆14Jun 8, 2018Updated 7 years ago
- Python wrapper of axel, a light command line download accelerator☆10Mar 26, 2017Updated 9 years ago
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 6 months ago
- Automatically modelling and distilling knowledge within AI. In other words, summarising the AI research firehose.☆24Mar 15, 2019Updated 7 years ago
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago
- CRFs based Chinese word segmentor☆21Oct 8, 2014Updated 11 years ago
- AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies☆29Aug 14, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆20Apr 12, 2024Updated last year
- Context-based Dialogue Act Recognition using Recurrent Neural Networks☆13Nov 13, 2021Updated 4 years ago
- Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆131Feb 24, 2025Updated last year
- ☆23Apr 5, 2023Updated 2 years ago
- Code for our ACL19 paper on argument generation☆14Nov 9, 2020Updated 5 years ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Apr 8, 2025Updated 11 months ago
- ☆24Oct 23, 2025Updated 5 months ago
- ☆16Sep 12, 2024Updated last year
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.☆57Nov 13, 2023Updated 2 years ago
- Shared code for training sentence embeddings with Flax / JAX☆28Jul 15, 2021Updated 4 years ago
- Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"☆14Oct 10, 2022Updated 3 years ago
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆59Oct 1, 2025Updated 5 months ago
- Official Repository for EvalRS @ KDD 2023: a Rounded Evaluation of Recommender Systems☆30Feb 16, 2024Updated 2 years ago
- Implementation of "Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions"☆24Aug 27, 2024Updated last year
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Apr 15, 2025Updated 11 months ago
- Influence Estimation for Gradient-Boosted Decision Trees☆29May 27, 2024Updated last year
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Dec 9, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Simple image database demo application (built with django)☆10Mar 3, 2014Updated 12 years ago
- Data and Code for ACL 2024 paper "DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Docu…☆23Dec 21, 2024Updated last year
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey☆109Aug 7, 2024Updated last year
- Tech Writing bot at your service 🤖☆13Jan 6, 2023Updated 3 years ago
- ICLR 2022☆18Apr 15, 2022Updated 3 years ago
- NeurIPS'24 - LLM Safety Landscape☆39Oct 21, 2025Updated 5 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆34Apr 1, 2025Updated 11 months ago