KindXiaoming / BIMT
Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.
☆166Updated last year
Alternatives and similar repositories for BIMT:
Users that are interested in BIMT are comparing it to the libraries listed below
- ☆169Updated 3 months ago
- Omnigrok: Grokking Beyond Algorithmic Data☆53Updated 2 years ago
- ☆52Updated 5 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆72Updated last week
- ☆209Updated 3 weeks ago
- A State-Space Model with Rational Transfer Function Representation.☆78Updated 10 months ago
- The boundary of neural network trainability is fractal☆195Updated last year
- The Energy Transformer block, in JAX☆56Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆108Updated 3 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆67Updated last month
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆70Updated last year
- ☆79Updated 11 months ago
- 🧱 Modula software package☆173Updated 2 weeks ago
- Flow-matching algorithms in JAX☆86Updated 7 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- A simple implimentation of Bayesian Flow Networks (BFN)☆240Updated last year
- ☆49Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆62Updated 7 months ago
- Graph neural networks in JAX.☆67Updated 9 months ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆76Updated 2 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆170Updated 3 months ago
- Implementation of PSGD optimizer in JAX☆29Updated 2 months ago
- Learning Universal Predictors☆76Updated 7 months ago
- Automatic gradient descent☆207Updated last year
- Understand and test language model architectures on synthetic tasks.☆185Updated 2 weeks ago
- Resources from the EleutherAI Math Reading Group☆53Updated 3 weeks ago
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆77Updated last year
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆114Updated 5 months ago
- TopoBench is a Python library designed to standardize benchmarking and accelerate research in Topological Deep Learning☆107Updated this week
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆82Updated 2 years ago