A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆98Apr 13, 2025Updated last year
Alternatives and similar repositories for get-haized
Users that are interested in get-haized are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16May 30, 2024Updated 2 years ago
- Red-Teaming Language Models with DSPy☆259Feb 13, 2025Updated last year
- ☆87Dec 22, 2023Updated 2 years ago
- Substrate TypeScript SDK☆10Sep 20, 2024Updated last year
- ☆50Aug 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Aug 3, 2024Updated last year
- A trivial programmatic Llama 3 jailbreak. Sorry Zuck!☆570Jan 26, 2025Updated last year
- Sphynx Hallucination Induction☆54Jan 31, 2025Updated last year
- ☆26Sep 3, 2025Updated 9 months ago
- Inference-time scaling for LLMs-as-a-judge.☆343Nov 5, 2025Updated 7 months ago
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- Repository for "Training Language Models To Explain Their Own Computations"☆22Dec 22, 2025Updated 5 months ago
- ☆15Jun 7, 2024Updated 2 years ago
- Prompt engineering, automated.☆353Apr 22, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆18Sep 15, 2023Updated 2 years ago
- A prompt defence is a multi-layer defence that can be used to protect your applications against prompt injection attacks.☆22Apr 8, 2026Updated 2 months ago
- [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak…☆3,716Dec 24, 2024Updated last year
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- ☆27Oct 6, 2024Updated last year
- Contrastive Chain-of-Thought Prompting☆69Nov 18, 2023Updated 2 years ago
- Fluent student-teacher redteaming☆23Jul 25, 2024Updated last year
- Grok by X (Twitter) System Prompt Leak☆64Dec 9, 2023Updated 2 years ago
- This is the code for the paper "Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation".☆38Apr 15, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, s…☆573Jan 11, 2025Updated last year
- A command line tool for crawling a webstite for dead links, permeant and or fatal redirects, resource load issues, and script errors. It…☆12Apr 16, 2023Updated 3 years ago
- ☆29Oct 22, 2024Updated last year
- ☆23May 1, 2026Updated last month
- ☆16Sep 12, 2024Updated last year
- ☆69May 23, 2025Updated last year
- Prompting and research workflow for lecture building☆16Sep 5, 2025Updated 9 months ago
- Baidu Qianfan Deep Research☆35Jun 8, 2026Updated last week
- Demo of AI chatbot that predicts user message to generate response quickly.☆106Feb 28, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Meme search engine for the real shitposters☆11Jan 27, 2026Updated 4 months ago
- A simple evaluation of generative language models and safety classifiers.☆100Updated this week
- Distributed parallel computing made easy.☆28May 26, 2023Updated 3 years ago
- PyTorch implementation of the NSGT/sliCQT☆17Nov 10, 2023Updated 2 years ago
- Payloads for Attacking Large Language Models☆138Jan 13, 2026Updated 5 months ago
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆81Jun 11, 2026Updated last week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆56Jan 12, 2026Updated 5 months ago