KindXiaoming / BIMTLinks
Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.
☆170Updated 2 years ago
Alternatives and similar repositories for BIMT
Users that are interested in BIMT are comparing it to the libraries listed below
Sorting:
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆58Updated 2 years ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆86Updated 2 months ago
- The boundary of neural network trainability is fractal☆204Updated last year
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆72Updated 2 years ago
- Neural Networks and the Chomsky Hierarchy☆205Updated last year
- Learning Universal Predictors☆74Updated 10 months ago
- Tools for studying developmental interpretability in neural networks.☆91Updated 4 months ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆15Updated 4 months ago
- Automatic gradient descent☆207Updated last year
- ☆174Updated last year
- A Mechanistic Interpretability Analysis of Grokking☆21Updated 2 years ago
- ☆51Updated last year
- Materials for ConceptARC paper☆94Updated 7 months ago
- σ-GPT: A New Approach to Autoregressive Models☆65Updated 9 months ago
- 🧠 Starter templates for doing interpretability research☆69Updated last year
- Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.☆36Updated last year
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆194Updated 3 weeks ago
- ☆36Updated 5 months ago
- Implementation of PSGD optimizer in JAX☆33Updated 5 months ago
- ☆185Updated 6 months ago
- ☆54Updated last year
- ☆154Updated last month
- Resources from the EleutherAI Math Reading Group☆53Updated 3 months ago
- Hierarchical Associative Memory User Experience☆100Updated last year
- The history files when recording human interaction while solving ARC tasks☆110Updated this week
- Bootstrapping ARC☆125Updated 6 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆128Updated 3 weeks ago
- The Energy Transformer block, in JAX☆57Updated last year
- Running Jax in PyTorch Lightning☆102Updated 5 months ago