haizelabs / get-haized
A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
☆86Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for get-haized
- Red-Teaming Language Models with DSPy☆142Updated 7 months ago
- Sphynx Hallucination Induction☆48Updated 3 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆94Updated this week
- Just a bunch of benchmark logs for different LLMs☆116Updated 3 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- ☆104Updated 8 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆56Updated 3 weeks ago
- ☆14Updated last month
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 3 months ago
- 🤖 Headless IDE for AI agents☆133Updated this week
- A trace analysis tool for AI agents.☆124Updated last month
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆100Updated this week
- An automated tool for discovering insights from research papaer corpora☆135Updated 5 months ago
- A framework for orchestrating AI agents using a mermaid graph☆75Updated 6 months ago
- look how they massacred my boy☆58Updated last month
- Stream of my favorite papers and links☆36Updated 2 months ago
- ☆39Updated 11 months ago
- Turn a Github Repo's contents into a big prompt for long-context models like Claude 3 Opus.☆149Updated 7 months ago
- Routing on Random Forest (RoRF)☆84Updated 2 months ago
- Track the progress of LLM context utilisation☆53Updated 4 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆23Updated last year
- ☆48Updated last year
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆17Updated 5 months ago
- A simple wrapper for OpenAI to log input/outputs.☆103Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆97Updated last year
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated 10 months ago