scienceetonnante / grokking
☆54Updated last year
Related projects ⓘ
Alternatives and complementary repositories for grokking
- Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents qu…☆116Updated this week
- The boundary of neural network trainability is fractal☆161Updated 9 months ago
- Répertoire des traductions françaises des notebooks d'Alfredo Canziani du cours Deep Learning de la NYU☆31Updated last year
- Tools for understanding how transformer predictions are built layer-by-layer☆433Updated 5 months ago
- Mechanistic Interpretability Visualizations using React☆200Updated 4 months ago
- ☆146Updated last month
- Using sparse coding to find distributed representations used by neural networks.☆189Updated last year
- Repository containing the code for training the CroissantLLM☆21Updated 9 months ago
- Exploration of the Lenia continuous cellular automaton☆75Updated 10 months ago
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆407Updated this week
- ☆139Updated 3 months ago
- Training Sparse Autoencoders on Language Models☆487Updated this week
- ☆109Updated this week
- Sparse autoencoders☆347Updated this week
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆485Updated last week
- Free and open source code of the https://tournesol.app platform. Meet the community on Discord https://discord.gg/WvcSG55Bf3☆330Updated this week
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆516Updated 4 months ago
- LENS Project☆42Updated 9 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Build and train Lipschitz-constrained networks: PyTorch implementation of 1-Lipschitz layers. For TensorFlow/Keras implementation, see ht…☆27Updated last week
- Sparse Autoencoder for Mechanistic Interpretability☆191Updated 4 months ago
- Tools for studying developmental interpretability in neural networks.☆77Updated last week
- Editing Models with Task Arithmetic☆431Updated 10 months ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆64Updated 2 years ago
- Benchmarks for the Evaluation of LLM Supervision☆28Updated this week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆161Updated last month
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead☆125Updated this week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆181Updated 5 months ago
- 🧠 Starter templates for doing interpretability research☆63Updated last year
- nanoGPT-like codebase for LLM training☆75Updated this week