amudide / switch_saeLinks

Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)

☆25

Alternatives and similar repositories for switch_sae

Users that are interested in switch_sae are comparing it to the libraries listed below

Sorting:

tyler-romero / microR1
Simple repository for training small reasoning models
☆33Updated 5 months ago
katiekang1998 / reasoning_generalization
☆33Updated 6 months ago
JoshEngels / SAE-Dark-Matter
Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"
☆22Updated 5 months ago
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆46Updated 3 months ago
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
kiddyboots216 / lottery-ticket-adaptation
Lottery Ticket Adaptation
☆39Updated 7 months ago
SamsungSAILMontreal / nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]
☆19Updated last month
shreyansh26 / Attention-Mask-Patterns
Using FlexAttention to compute attention with different masking patterns
☆44Updated 9 months ago
likenneth / q_probe
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
☆41Updated last year
samuelarnesen / nyu-debate-modeling
☆22Updated 9 months ago
data-for-agents / insta
Official Repo for InSTA: Towards Internet-Scale Training For Agents
☆48Updated last week
ethz-spylab / superhuman-ai-consistency
☆29Updated 2 years ago
IBM / ColPret
Efficient Scaling laws and collaborative pretraining.
☆16Updated 5 months ago
microsoft / mechanistic-error-probe
A mechanistic approach for understanding and detecting factual errors of large language models.
☆46Updated last year
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆35Updated last year
sher222 / LeReT
Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
☆39Updated 8 months ago
ctlllll / understanding_llm_benchmarks
Understanding the correlation between different LLM benchmarks
☆29Updated last year
codezakh / DataEnvGym
A testbed for agents and environments that can automatically improve models through data generation.
☆24Updated 4 months ago
complex-reasoning / RPG
The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)
☆35Updated last week
yidingjiang / ado
The repository contains code for Adaptive Data Optimization
☆25Updated 7 months ago
upiterbarg / lintseq
[ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)
☆19Updated 5 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆33Updated 3 months ago
facebookresearch / dualformer
implementation of dualformer
☆18Updated 4 months ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 5 months ago
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆75Updated 8 months ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago
ml-jku / EVA
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆40Updated 9 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆86Updated last year
ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆90Updated 8 months ago
google-deepmind / alta
☆25Updated 9 months ago