☆155Feb 16, 2026Updated 2 weeks ago
Alternatives and similar repositories for mishax
Users that are interested in mishax are comparing it to the libraries listed below
Sorting:
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆247Updated this week
- Sparsify transformers with SAEs and transcoders☆696Feb 23, 2026Updated last week
- Training Sparse Autoencoders on Language Models☆1,219Feb 23, 2026Updated last week
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆243Feb 23, 2026Updated last week
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆825Feb 23, 2026Updated last week
- Code for "What really matters in matrix-whitening optimizers?"☆21Oct 31, 2025Updated 4 months ago
- ☆134Oct 28, 2023Updated 2 years ago
- A library for mechanistic interpretability of GPT-style language models☆3,112Feb 23, 2026Updated last week
- ☆89Dec 18, 2025Updated 2 months ago
- Modified to support crosscoder training.☆25Feb 4, 2026Updated 3 weeks ago
- Sparse Autoencoder Training Library☆55May 1, 2025Updated 10 months ago
- ☆209Oct 14, 2025Updated 4 months ago
- ☆36Apr 30, 2024Updated last year
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆17Jun 29, 2025Updated 8 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆64Oct 27, 2024Updated last year
- A TinyStories LM with SAEs and transcoders☆14Apr 3, 2025Updated 11 months ago
- ☆23Jun 30, 2025Updated 8 months ago
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 4 months ago
- Mechanistic Interpretability Visualizations using React☆326Dec 18, 2024Updated last year
- ☆199Nov 17, 2024Updated last year
- A library for making RepE control vectors☆689Sep 24, 2025Updated 5 months ago
- ☆396Aug 21, 2025Updated 6 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆21Oct 28, 2024Updated last year
- Repository of IPBench☆19Jan 4, 2026Updated last month
- Steering Llama 2 with Contrastive Activation Addition☆212May 23, 2024Updated last year
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated last year
- ☆19Jan 21, 2023Updated 3 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated last month
- Applying SAEs for fine-grained control☆25Dec 15, 2024Updated last year
- ☆571Jul 19, 2024Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- ☆153Dec 30, 2025Updated 2 months ago
- Stanford NLP Python library for understanding and improving PyTorch models via interventions☆863Jan 29, 2026Updated last month
- Representation Engineering: A Top-Down Approach to AI Transparency☆953Aug 14, 2024Updated last year
- ☆36Jul 4, 2025Updated 7 months ago
- ☆25Sep 5, 2024Updated last year
- ☆44Nov 16, 2021Updated 4 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆35Oct 16, 2025Updated 4 months ago