A Domain-Specific Language, Jailbreak Attack Synthesizer and Dynamic LLM Redteaming Toolkit
☆27Dec 5, 2024Updated last year
Alternatives and similar repositories for h4rm3l
Users that are interested in h4rm3l are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 1, 2022Updated 3 years ago
- ☆10Nov 8, 2022Updated 3 years ago
- ☆12Oct 25, 2023Updated 2 years ago
- A Dataset and Results for Classifying Emotions Across Languages☆10Jun 20, 2021Updated 4 years ago
- Belief in the Machine: Investigating Epistemological Blind Spots of Language Models☆32Apr 19, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2025] Official implementation for JOOD "Playing the Fool: Jailbreaking LLMs and Multimodal LLMs with Out-of-Distribution Strategy"☆22Jun 11, 2025Updated 10 months ago
- ☆45Apr 29, 2025Updated 11 months ago
- Python wrapper of axel, a light command line download accelerator☆10Mar 26, 2017Updated 9 years ago
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 6 months ago
- Code for "Unsupervised Enrichment of Persona-grounded Dialog with Background Stories", ACL 2021☆10Jul 8, 2021Updated 4 years ago
- Automatically modelling and distilling knowledge within AI. In other words, summarising the AI research firehose.☆24Mar 15, 2019Updated 7 years ago
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago
- CRFs based Chinese word segmentor☆21Oct 8, 2014Updated 11 years ago
- AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies☆30Aug 14, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Apr 12, 2024Updated 2 years ago
- Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆132Feb 24, 2025Updated last year
- ☆23Apr 5, 2023Updated 3 years ago
- Code for our ACL19 paper on argument generation☆14Nov 9, 2020Updated 5 years ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Apr 8, 2025Updated last year
- ☆26Oct 23, 2025Updated 5 months ago
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 8 months ago
- Shared code for training sentence embeddings with Flax / JAX☆28Jul 15, 2021Updated 4 years ago
- Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"☆14Oct 10, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- quica is a tool to run inter coder agreement pipelines in an easy and effective ways. Multiple measures are run and results are collected…☆23Nov 9, 2020Updated 5 years ago
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆59Oct 1, 2025Updated 6 months ago
- Chat with any codebase with MCP servers in a single command☆13May 28, 2025Updated 10 months ago
- Implementation of "Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions"☆24Aug 27, 2024Updated last year
- Influence Estimation for Gradient-Boosted Decision Trees☆29May 27, 2024Updated last year
- Simple reimplementation of Maximum Density Divergence for Unsupervised Domain Adaptation (https://arxiv.org/abs/2004.12615) in PyTorch Li…☆26Apr 13, 2021Updated 5 years ago
- EMNLP 2020: Personalized Dialog Generation with Commonsense☆18Oct 12, 2022Updated 3 years ago
- Data and Code for ACL 2024 paper "DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Docu…☆23Dec 21, 2024Updated last year
- ICLR 2022☆18Apr 15, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- NeurIPS'24 - LLM Safety Landscape☆39Oct 21, 2025Updated 5 months ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆26May 15, 2025Updated 11 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last week
- ☆22Dec 8, 2022Updated 3 years ago
- Restore safety in fine-tuned language models through task arithmetic☆32Mar 28, 2024Updated 2 years ago
- Topic Detection and Tracking☆19Apr 21, 2015Updated 10 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago