Sohl-Dickstein / fractalLinks
The boundary of neural network trainability is fractal
☆217Updated last year
Alternatives and similar repositories for fractal
Users that are interested in fractal are comparing it to the libraries listed below
Sorting:
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆173Updated 2 years ago
- ☆150Updated last year
- Getting crystal-like representations with harmonic loss☆192Updated 7 months ago
- ☆37Updated this week
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆103Updated last month
- Compositional Linear Algebra☆491Updated 3 months ago
- Minimal GPT (~350 lines with a simple task to test it)☆63Updated 11 months ago
- Uncertainty quantification with PyTorch☆375Updated last month
- Implementation of Diffusion Transformer (DiT) in JAX☆294Updated last year
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆324Updated 4 months ago
- Tools for working with the Abstraction & Reasoning Corpus☆211Updated 2 months ago
- ☆285Updated last year
- 🧱 Modula software package☆303Updated 2 months ago
- Deep Learning, an Energy Approach☆220Updated 5 months ago
- ☆222Updated 11 months ago
- A projection-based framework for gradient-free and parallel learning☆106Updated 4 months ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆75Updated 2 years ago
- Diffusion models in PyTorch☆112Updated last week
- Code for the book "The Elements of Differentiable Programming".☆273Updated 4 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- Automatic gradient descent☆215Updated 2 years ago
- An interactive exploration of Transformer programming.☆271Updated 2 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated last month
- Brain-like variational inference☆57Updated 5 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆148Updated last month
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆564Updated last year
- ☆198Updated 3 months ago
- Run PyTorch in JAX. 🤝☆306Updated last month
- A simple implimentation of Bayesian Flow Networks (BFN)☆240Updated last year
- A 1D analogue of the MNIST dataset for measuring spatial biases and answering Science of Deep Learning questions.☆233Updated last year