adamkarvonen / SAEBenchLinks
☆97Updated last month
Alternatives and similar repositories for SAEBench
Users that are interested in SAEBench are comparing it to the libraries listed below
Sorting:
- ☆43Updated 6 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆200Updated 5 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆55Updated 7 months ago
- ☆223Updated 8 months ago
- Using sparse coding to find distributed representations used by neural networks.☆247Updated last year
- A library for efficient patching and automatic circuit discovery.☆65Updated last month
- ☆170Updated last month
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆180Updated this week
- ☆93Updated 3 months ago
- ☆120Updated 6 months ago
- ☆121Updated last year
- ☆302Updated 2 weeks ago
- Steering Llama 2 with Contrastive Activation Addition☆154Updated last year