JacobPfau / procgenAISCLinks
☆19Updated 2 years ago
Alternatives and similar repositories for procgenAISC
Users that are interested in procgenAISC are comparing it to the libraries listed below
Sorting:
- Redwood Research's transformer interpretability tools☆14Updated 3 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆87Updated last year
- Mechanistic Interpretability for Transformer Models☆51Updated 3 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 3 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆127Updated 2 years ago
- Sparse Autoencoder Training Library☆54Updated 3 months ago