HanClinto / MENTAT
☆8Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for MENTAT
- ☆100Updated 3 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆84Updated 3 months ago
- Experiments for efforts to train a new and improved t5☆76Updated 6 months ago
- Understanding how features learned by neural networks evolve throughout training☆31Updated 3 weeks ago
- Multi-Domain Expert Learning☆67Updated 9 months ago
- ☆76Updated 6 months ago
- Sparse and discrete interpretability tool for neural networks☆54Updated 9 months ago
- ☆22Updated last year
- Simplex Random Feature attention, in PyTorch☆71Updated last year
- ☆40Updated last week
- Tools to make language models a bit easier to use☆30Updated 2 weeks ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆39Updated 10 months ago
- A Collection of Pydantic Models to Abstract IRL☆15Updated this week
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆16Updated 8 months ago
- ☆44Updated last month
- ☆36Updated 3 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆105Updated 2 weeks ago
- ☆39Updated 9 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- gzip Predicts Data-dependent Scaling Laws☆32Updated 5 months ago
- code for training & evaluating Contextual Document Embedding models☆93Updated this week
- ☆49Updated 6 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- ☆43Updated 3 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆13Updated 2 weeks ago
- ☆55Updated 11 months ago
- ☆26Updated 4 months ago
- ☆31Updated 9 months ago
- ☆40Updated 3 weeks ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆23Updated 3 weeks ago