google-deepmind / mishax
☆124Updated this week
Alternatives and similar repositories for mishax:
Users that are interested in mishax are comparing it to the libraries listed below
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆161Updated this week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆185Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆100Updated 4 months ago
- Functional Benchmarks and the Reasoning Gap☆84Updated 5 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆186Updated 9 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆70Updated 3 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆45Updated 4 months ago
- ☆32Updated 3 weeks ago
- A MAD laboratory to improve AI architecture designs 🧪☆107Updated 3 months ago
- ☆61Updated 4 months ago
- PyTorch library for Active Fine-Tuning☆60Updated last month
- ☆26Updated 11 months ago
- ☆89Updated last month
- A toolkit for describing model features and intervening on those features to steer behavior.☆162Updated 4 months ago
- ☆62Updated last month
- Experiments for efforts to train a new and improved t5☆77Updated 11 months ago
- Language models scale reliably with over-training and on downstream tasks☆96Updated 11 months ago
- ☆121Updated last year
- code for training & evaluating Contextual Document Embedding models☆176Updated 2 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆122Updated 11 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated 9 months ago
- ☆110Updated 3 weeks ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆58Updated 4 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆116Updated 2 years ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆90Updated 3 weeks ago
- ☆80Updated 2 months ago
- 🧠 Starter templates for doing interpretability research☆67Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆80Updated 3 years ago
- An introduction to LLM Sampling☆77Updated 3 months ago