amudide / switch_sae
Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)
☆20Updated 2 months ago
Alternatives and similar repositories for switch_sae:
Users that are interested in switch_sae are comparing it to the libraries listed below
- ☆26Updated last month
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆19Updated last week
- Using FlexAttention to compute attention with different masking patterns☆40Updated 4 months ago
- The repository contains code for Adaptive Data Optimization☆20Updated 2 months ago
- Minimum Description Length probing for neural network representations☆18Updated 2 weeks ago
- Lottery Ticket Adaptation☆37Updated 2 months ago
- ☆58Updated 9 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆67Updated 2 months ago
- Evaluation of neuro-symbolic engines☆34Updated 6 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated 8 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆18Updated last month
- ☆21Updated 4 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆36Updated 4 months ago
- A mechanistic approach for understanding and detecting factual errors of large language models.☆40Updated 7 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- Sparse and discrete interpretability tool for neural networks☆58Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- Efficient Scaling laws and collaborative pretraining.☆14Updated 2 weeks ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated 8 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆25Updated 9 months ago
- Aioli: A unified optimization framework for language model data mixing☆20Updated 3 weeks ago
- ☆71Updated 5 months ago
- ☆12Updated 11 months ago
- ☆26Updated 7 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 2 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated 11 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆14Updated this week