for-ai / llm-profiling-toolkit
☆10Updated 2 months ago
Related projects: ⓘ
- Experiments for efforts to train a new and improved t5☆76Updated 5 months ago
- Sparse autoencoders for Contra text embedding models☆24Updated 4 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated last month
- ☆91Updated last month
- utilities for loading and running text embeddings with onnx☆39Updated last month
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆62Updated last year
- ☆34Updated 3 weeks ago
- ☆48Updated 11 months ago
- ☆15Updated 3 months ago
- Code for Zero-Shot Tokenizer Transfer☆109Updated 2 months ago
- ☆121Updated last month
- Red-Teaming Language Models with DSPy☆116Updated 5 months ago
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- Small, simple agent task environments for training and evaluation☆13Updated last week
- ☆24Updated 5 months ago
- ☆29Updated 2 weeks ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆81Updated 6 months ago
- ☆43Updated this week
- ☆38Updated this week
- Code repository for the c-BTM paper☆105Updated 11 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- ☆68Updated 2 months ago
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆73Updated last month
- ☆44Updated 2 months ago
- direct preference optimization with only 1 model copy :)☆12Updated 11 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 4 months ago
- ☆54Updated last week
- Understanding how features learned by neural networks evolve throughout training☆30Updated this week
- ☆13Updated this week