Thorn in a HaizeStack test for evaluating long-context adversarial robustness.
☆26Aug 3, 2024Updated last year
Alternatives and similar repositories for thorn-in-haizestack
Users that are interested in thorn-in-haizestack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sphynx Hallucination Induction☆54Jan 31, 2025Updated last year
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Mar 25, 2026Updated last month
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- Red-Teaming Language Models with DSPy☆256Feb 13, 2025Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Apr 24, 2026Updated 2 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances☆20Jan 15, 2026Updated 3 months ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- ☆19Dec 4, 2025Updated 5 months ago
- Applying SAEs for fine-grained control☆26Dec 15, 2024Updated last year
- Debian packaging for NNCP [archived], moved to https://salsa.debian.org/go-team/packages/nncp☆14Feb 18, 2023Updated 3 years ago
- Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models☆36Jun 1, 2025Updated 11 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆99Apr 13, 2025Updated last year
- beep boop personal website hosted at tinabmai.com☆17Updated this week
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Modified to support crosscoder training.☆27Feb 4, 2026Updated 3 months ago
- ☆13Feb 12, 2023Updated 3 years ago
- Access the Cohere Command R family of models☆39Mar 28, 2025Updated last year
- Distillation of Ensemble Dependency Parsers into a Single Graph-Based Parser☆11Oct 14, 2016Updated 9 years ago
- Tools to simplify life with AI☆30Apr 4, 2025Updated last year
- Instant Neural Graphics Primitives from scratch, zero dependencies. Learning by doing.☆10Aug 18, 2023Updated 2 years ago
- Limits asset outflows from contracts within customisable timeframes☆11May 7, 2022Updated 4 years ago
- ☆12Apr 19, 2024Updated 2 years ago
- An API that allows you to scrape blog posts and articles and get a list of notes or a summary back.☆10Mar 31, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pipeline for the production of digital scholarly editions of archival collections☆14Feb 22, 2024Updated 2 years ago
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- mHC kernels implemented in CUDA☆265Mar 9, 2026Updated 2 months ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago
- ☆29Apr 30, 2024Updated 2 years ago
- Newspaper Segmentation into images and text☆12Jan 11, 2019Updated 7 years ago
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago
- A search engine for Lean 4 declarations☆64Updated this week
- ☆14Apr 27, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- repo for code for paper on general theory associative memory models☆22Jun 15, 2022Updated 3 years ago
- ☆15Jul 9, 2025Updated 10 months ago
- Global Greedy Dependency Parsing☆10Mar 16, 2021Updated 5 years ago
- Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.☆25Jan 26, 2024Updated 2 years ago
- How to create a real-time UI with NextJS and Supabase☆12Jun 14, 2024Updated last year
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- Fluent dreaming for language models☆13Jul 22, 2024Updated last year