Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)
☆25Dec 1, 2024Updated last year
Alternatives and similar repositories for switch_sae
Users that are interested in switch_sae are comparing it to the libraries listed below
Sorting:
- ☆24Aug 23, 2025Updated 6 months ago
- Phonic Node.js SDK☆14Mar 12, 2026Updated last week
- ☆15Jul 9, 2025Updated 8 months ago
- ☆403Aug 21, 2025Updated 7 months ago
- ☆20Nov 3, 2024Updated last year
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆245Updated this week
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆23Feb 6, 2025Updated last year
- 定时锁机☆12Feb 26, 2017Updated 9 years ago
- (SIGIR 25) Repo for "Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation"☆10Jan 18, 2025Updated last year
- ☆19Aug 10, 2024Updated last year
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated 3 months ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- ☆92Mar 28, 2025Updated 11 months ago
- Gene regulatory network inference for RNA velocity and pseudotime data☆29Nov 6, 2025Updated 4 months ago
- ☆43Feb 22, 2026Updated last month
- ☆157Dec 30, 2025Updated 2 months ago
- ☆14Apr 14, 2025Updated 11 months ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆29Feb 6, 2026Updated last month
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- Principal component analysis using a linear autoencoder☆16May 25, 2019Updated 6 years ago
- [NeurIPS'23] Binary Classification with Confidence Difference☆10May 13, 2024Updated last year
- ☆11Mar 31, 2022Updated 3 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆251Feb 27, 2026Updated 3 weeks ago
- ☆15Mar 30, 2025Updated 11 months ago
- [ICML 2023] Protecting Language Generation Models via Invisible Watermarking☆13Sep 8, 2023Updated 2 years ago
- A labeled dataset used for the knowledge graph construction.☆34Nov 30, 2023Updated 2 years ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 3 months ago
- Chest X-Ray Explainer (ChEX)☆23Jan 30, 2025Updated last year
- ☆11Mar 20, 2023Updated 3 years ago
- Repository for the NeurIPS 2023 paper "Beyond Confidence: Reliable Models Should Also Consider Atypicality"☆13Apr 21, 2024Updated last year
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 4 months ago
- a single interface around speech-to-speech foundation models☆28Jun 27, 2025Updated 8 months ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- nyc is so back☆21Jun 27, 2025Updated 8 months ago