redwoodresearch / rust_circuit_publicLinks
☆65Updated 2 years ago
Alternatives and similar repositories for rust_circuit_public
Users that are interested in rust_circuit_public are comparing it to the libraries listed below
Sorting:
- Tools for studying developmental interpretability in neural networks.☆122Updated last week
- ☆132Updated 2 years ago
- ☆262Updated last year
- Mechanistic Interpretability Visualizations using React☆307Updated last year
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Updated 11 months ago
- ☆135Updated last year
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆236Updated last year
- ☆283Updated last year
- ☆29Updated last year
- Extract full next-token probabilities via language model APIs☆248Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆132Updated 3 years ago
- ☆17Updated 3 weeks ago
- (Model-written) LLM evals library