matchten/LoRA-Models-for-SAEs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/matchten/LoRA-Models-for-SAEs)

matchten / LoRA-Models-for-SAEs

Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"

☆17

Alternatives and similar repositories for LoRA-Models-for-SAEs

Users that are interested in LoRA-Models-for-SAEs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Butanium / tiny-activation-dashboard
View on GitHub
A tiny easily hackable implementation of a feature dashboard.
☆17Oct 21, 2025Updated 9 months ago
KempnerInstitute / llm_uncertainty
View on GitHub
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆11Jul 18, 2026Updated last week
ThirdAIResearch / Dessert
View on GitHub
DESSERT Effeciently Searches Sets of Embeddings via Retrieval Tables
☆18Feb 21, 2024Updated 2 years ago
JoshEngels / FLINNG
View on GitHub
A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing
☆23Dec 9, 2023Updated 2 years ago
zepingyu0512 / arithmetic-mechanism
View on GitHub
code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
☆12Nov 17, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
delphi-suite / delphi
View on GitHub
small language models training made easy
☆15Dec 15, 2024Updated last year
JoshEngels / SAE-Dark-Matter
View on GitHub
Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"
☆23Feb 6, 2025Updated last year
allenai / beacon
View on GitHub
On-the-fly Definition Augmentation of LLMs for Biomedical NER
☆14Apr 14, 2025Updated last year
Asap7772 / fewshot-preference-optimization
View on GitHub
Few-Shot Preference Optimization (FSPO) personalizes LLMs by reframing reward modeling as a meta-learning problem, enabling rapid adaptat…
☆16Feb 27, 2025Updated last year
alexrutar / banditvis
View on GitHub
A Python 3 Bandit Visualization Package
☆11Oct 16, 2017Updated 8 years ago
science-of-finetuning / crosscoder_learning
View on GitHub
Modified to support crosscoder training.
☆27Jul 2, 2026Updated 3 weeks ago
MingLiiii / Gradient_Unified
View on GitHub
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
☆20Jun 17, 2025Updated last year
slavachalnev / SAE-TS
View on GitHub
Improving Steering Vectors by Targeting Sparse Autoencoder Features
☆29Nov 20, 2024Updated last year
TaiMingLu / know-dont-tell
View on GitHub
☆19Oct 14, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
joeljang / FLM
View on GitHub
All-in-one repository for Fine-tuning & Pretraining (Large) Language Models
☆15Mar 8, 2023Updated 3 years ago
technion-cs-nlp / parametric-faithfulness
View on GitHub
☆23Aug 30, 2025Updated 10 months ago
zsLin177 / camr
View on GitHub
The system of SUDA-HUAWEI submitted at CAMR2022.
☆12Nov 22, 2022Updated 3 years ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
THU-KEG / PairJudgeRM
View on GitHub
☆15Apr 14, 2025Updated last year
lvwerra / deep-math
View on GitHub
Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"
☆30Mar 25, 2023Updated 3 years ago
Ksuriuri / LLMCI
View on GitHub
Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers
☆14Jun 7, 2024Updated 2 years ago
zepingyu0512 / in-context-mechanism
View on GitHub
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆13Nov 17, 2024Updated last year
mt-upc / logit-explanations
View on GitHub
☆18Jun 19, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KihoPark / LLM_Categorical_Hierarchical_Representations
View on GitHub
Code for 'The Geometry of Categorical and Hierarchical Concepts in Large Language Models' (ICLR 2025, Oral)
☆115Feb 11, 2025Updated last year
CoopReason / TESSY
View on GitHub
A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data
☆34May 1, 2026Updated 2 months ago
forwchen / LLaVA-MoLE
View on GitHub
☆10Mar 4, 2024Updated 2 years ago
saprmarks / feature-circuits
View on GitHub
☆223Oct 14, 2025Updated 9 months ago
yafuly / CoGnition
View on GitHub
☆17Nov 10, 2021Updated 4 years ago
hiendt58 / machine-learning
View on GitHub
☆13Apr 17, 2018Updated 8 years ago
rosieyzh / openrlhf-pretrain
View on GitHub
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆29Oct 14, 2025Updated 9 months ago
Trustworthy-ML-Lab / Describe-and-Dissect
View on GitHub
[TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models
☆11Feb 20, 2025Updated last year
SethEBaldwin / mdscuda
View on GitHub
CUDA implementation of Multidimensional Scaling
☆15May 8, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zeicold / pycnnum
View on GitHub
Converting Chinese number string <=> int/float/str
☆20Apr 29, 2025Updated last year
alexgaskell10 / nlp_summarization
View on GitHub
Contains the code for my Imperial College London Master's thesis on text summarization
☆11Oct 25, 2022Updated 3 years ago
jacky121298 / WLST
View on GitHub
[ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection
☆12Feb 6, 2024Updated 2 years ago
cmmp / pyproclus
View on GitHub
A python implementation of PROCLUS: PROjected CLUStering algorithm.
☆10Jan 12, 2015Updated 11 years ago
jugechengzi / Rationalization-MGR
View on GitHub
ACL 2023 *oral* paper "MGR: Multi-generator based Rationalization"
☆10Nov 21, 2024Updated last year
phquang / Continual-Normalization
View on GitHub
☆14Sep 7, 2022Updated 3 years ago
vedantpalit / Towards-Vision-Language-Mechanistic-Interpretability
View on GitHub
This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…
☆25Feb 16, 2026Updated 5 months ago