Sohl-Dickstein / fractalLinks
The boundary of neural network trainability is fractal
☆215Updated last year
Alternatives and similar repositories for fractal
Users that are interested in fractal are comparing it to the libraries listed below
Sorting:
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆172Updated 2 years ago
- ☆150Updated 11 months ago
- 🧱 Modula software package☆216Updated last week
- Getting crystal-like representations with harmonic loss☆192Updated 4 months ago
- Minimal GPT (~350 lines with a simple task to test it)☆62Updated 7 months ago
- ☆275Updated last year
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆72Updated 2 years ago
- For optimization algorithm research and development.☆525Updated this week
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆559Updated last year
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆274Updated 2 weeks ago
- Tools for working with the Abstraction & Reasoning Corpus☆196Updated 11 months ago
- Interactive textbook on state-space models☆196Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆286Updated last year
- ☆206Updated 8 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆184Updated 10 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆93Updated 4 months ago
- An interactive exploration of Transformer programming.☆267Updated last year
- ☆51Updated last year
- Exact method for visualizing partitions of a Deep Neural Network, CVPR 2023 Highlight☆109Updated 5 months ago
- Annotated version of the Mamba paper☆487Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- Compositional Linear Algebra☆487Updated this week
- ☆38Updated 7 months ago
- A 1D analogue of the MNIST dataset for measuring spatial biases and answering Science of Deep Learning questions.☆227Updated 9 months ago
- Automatic gradient descent☆208Updated 2 years ago
- The history files when recording human interaction while solving ARC tasks☆114Updated last week
- Uncertainty quantification with PyTorch☆367Updated 3 months ago
- Resources from the EleutherAI Math Reading Group☆53Updated 5 months ago
- Deep Learning, an Energy Approach☆199Updated last month
- Implementation of https://srush.github.io/annotated-s4☆500Updated last month