amudide / switch_saeLinks
Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)
☆23Updated 6 months ago
Alternatives and similar repositories for switch_sae
Users that are interested in switch_sae are comparing it to the libraries listed below
Sorting:
- ☆32Updated 5 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆42Updated this week
- Efficient Scaling laws and collaborative pretraining.☆16Updated 4 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 3 months ago
- ☆65Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆22Updated 4 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆32Updated 3 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆75Updated 6 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆10Updated 2 months ago
- ☆23Updated 8 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 9 months ago
- implementation of dualformer☆17Updated 3 months ago
- ☆22Updated 8 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated this week
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆95Updated 3 weeks ago
- Exploration of automated dataset selection approaches at large scales.☆45Updated 3 months ago
- A mechanistic approach for understanding and detecting factual errors of large language models.☆46Updated 11 months ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- Using FlexAttention to compute attention with different masking patterns☆44Updated 9 months ago
- ☆18Updated 2 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆27Updated last year
- ☆14Updated last year
- Lottery Ticket Adaptation☆39Updated 7 months ago
- ☆23Updated 4 months ago
- ☆53Updated last week
- ☆32Updated last year
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Updated 4 months ago
- The repository contains code for Adaptive Data Optimization☆25Updated 6 months ago