KindXiaoming / BIMT
Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.
☆163Updated last year
Alternatives and similar repositories for BIMT:
Users that are interested in BIMT are comparing it to the libraries listed below
- Automatic gradient descent☆206Updated last year
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆69Updated last year
- The boundary of neural network trainability is fractal☆190Updated 11 months ago
- Omnigrok: Grokking Beyond Algorithmic Data☆51Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆67Updated 2 years ago
- Graph neural networks in JAX.☆67Updated 7 months ago
- Learning Universal Predictors☆71Updated 5 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆102Updated last month
- ☆146Updated last month
- Cellular Automata Accelerated in JAX☆77Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆42Updated last month
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆182Updated 7 months ago
- 🧠 Starter templates for doing interpretability research☆64Updated last year
- ☆201Updated 6 months ago
- σ-GPT: A New Approach to Autoregressive Models☆61Updated 5 months ago
- Implementation of PSGD optimizer in JAX☆26Updated 2 weeks ago
- Uncertainty quantification with PyTorch☆333Updated 2 months ago
- ☆37Updated 2 years ago
- Easy Hypernetworks in Pytorch and Jax☆96Updated last year
- Flow-matching algorithms in JAX☆82Updated 5 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆116Updated 2 months ago
- SR based on LLMs.☆92Updated 2 years ago
- Hierarchical Associative Memory User Experience☆94Updated last year
- A Mechanistic Interpretability Analysis of Grokking☆19Updated 2 years ago
- Neural Networks and the Chomsky Hierarchy☆192Updated 9 months ago
- A State-Space Model with Rational Transfer Function Representation.☆76Updated 8 months ago
- Reverse Engineering the Abstraction and Reasoning Corpus☆216Updated 3 months ago
- The Energy Transformer block, in JAX☆54Updated last year
- ☆48Updated 11 months ago
- My writings about ARC (Abstraction and Reasoning Corpus)☆64Updated last week