Sohl-Dickstein / fractalLinks
The boundary of neural network trainability is fractal
☆210Updated last year
Alternatives and similar repositories for fractal
Users that are interested in fractal are comparing it to the libraries listed below
Sorting:
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆171Updated 2 years ago
- ☆150Updated 11 months ago
- Getting crystal-like representations with harmonic loss☆191Updated 3 months ago
- Minimal GPT (~350 lines with a simple task to test it)☆62Updated 7 months ago
- ☆197Updated 7 months ago
- 🧱 Modula software package☆204Updated 3 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆263Updated 4 months ago
- Deep Learning, an Energy Approach☆187Updated last month
- ☆273Updated last year
- Tools for working with the Abstraction & Reasoning Corpus☆194Updated 11 months ago
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆205Updated 2 months ago
- Code for the book "The Elements of Differentiable Programming".☆241Updated 3 weeks ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆72Updated 2 years ago
- Uncertainty quantification with PyTorch☆362Updated 3 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆279Updated last year
- ☆36Updated 7 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆140Updated last month
- Brain-like variational inference☆55Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆91Updated 4 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated 11 months ago
- Compositional Linear Algebra☆478Updated last month
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆97Updated 6 months ago
- Interactive textbook on state-space models☆194Updated last year
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆555Updated last year
- ☆164Updated 3 months ago
- Patched Attention for Nonlinear Dynamics☆150Updated 2 weeks ago
- ☆65Updated 2 years ago
- Flow-matching algorithms in JAX☆97Updated 11 months ago
- Video Diffusion Model. Autoregressive, long context, efficient training and inference. WIP☆28Updated this week
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated last month