google-deepmind / mishax
☆117Updated last week
Alternatives and similar repositories for mishax:
Users that are interested in mishax are comparing it to the libraries listed below
- ☆139Updated this week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆177Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆91Updated 2 months ago
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- ☆54Updated 2 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆81Updated 2 months ago
- ☆25Updated 9 months ago
- ☆45Updated this week
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆53Updated 2 months ago
- Erasing concepts from neural representations with provable guarantees☆221Updated this week
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆66Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆158Updated 2 weeks ago
- code for training & evaluating Contextual Document Embedding models☆166Updated 2 weeks ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆31Updated 3 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆75Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆156Updated 3 months ago
- Extract full next-token probabilities via language model APIs☆229Updated 11 months ago
- ☆25Updated this week
- Sparse autoencoders☆414Updated last week
- ☆24Updated 2 months ago
- Experiments for efforts to train a new and improved t5☆77Updated 9 months ago
- ☆80Updated 3 weeks ago
- ☆54Updated 3 weeks ago
- Mechanistic Interpretability Visualizations using React☆223Updated last month
- Automatic Evals for LLMs☆128Updated this week
- ☆49Updated 4 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆183Updated 8 months ago
- ☆220Updated last week
- ☆116Updated last year