haizelabs / thorn-in-haizestack
Thorn in a HaizeStack test for evaluating long-context adversarial robustness.
☆26Updated last month
Related projects: ⓘ
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆44Updated 3 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆30Updated last month
- Small, simple agent task environments for training and evaluation☆13Updated last week
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- utilities for loading and running text embeddings with onnx☆39Updated last month
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- Red-Teaming Language Models with DSPy☆116Updated 5 months ago
- Sphynx Hallucination Induction☆44Updated last month
- ☆91Updated last month
- ☆29Updated 2 weeks ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆23Updated 10 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 4 months ago
- ☆68Updated 2 months ago
- ☆48Updated 11 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆40Updated 6 months ago
- Experiments for efforts to train a new and improved t5☆76Updated 5 months ago
- ☆38Updated this week
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆73Updated 6 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆49Updated 3 weeks ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆60Updated last year
- ☆40Updated 4 months ago
- ☆77Updated 3 weeks ago
- Flow of Reasoning: Efficient Training of LLM Policy with Diverse Thinking☆25Updated this week
- Attribute (or cite) statements generated by LLMs back to in-context information.☆107Updated 2 weeks ago
- Using modal.com to process FineWeb-edu data☆18Updated last week
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆36Updated 3 weeks ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆52Updated last month
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆18Updated 2 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆77Updated 3 months ago