KindXiaoming / BIMT
Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.
☆170Updated last year
Alternatives and similar repositories for BIMT
Users that are interested in BIMT are comparing it to the libraries listed below
Sorting:
- ☆181Updated 5 months ago
- The boundary of neural network trainability is fractal☆199Updated last year
- 🧱 Modula software package☆191Updated last month
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆122Updated last month
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆117Updated 11 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆82Updated 2 months ago
- Omnigrok: Grokking Beyond Algorithmic Data☆56Updated 2 years ago
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆183Updated this week
- Learning Universal Predictors☆73Updated 9 months ago
- The Energy Transformer block, in JAX☆57Updated last year
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆71Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆115Updated 5 months ago
- A State-Space Model with Rational Transfer Function Representation.☆78Updated last year
- Bare-bones implementations of some generative models in Jax: diffusion, normalizing flows, consistency models, flow matching, (beta)-VAEs…☆129Updated last year
- Tools for working with the Abstraction & Reasoning Corpus☆187Updated 9 months ago
- σ-GPT: A New Approach to Autoregressive Models☆64Updated 9 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆100Updated 4 months ago
- ☆49Updated last year
- ☆94Updated 3 months ago
- Hierarchical Associative Memory User Experience☆101Updated last year
- ☆147Updated last month
- ☆53Updated last year
- Automatic gradient descent☆207Updated last year
- Flow-matching algorithms in JAX☆90Updated 9 months ago
- ☆291Updated 4 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆120Updated last week
- Codes for the paper "A mathematical perspective on Transformers".☆36Updated 10 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 9 months ago