Sohl-Dickstein / fractalLinks
The boundary of neural network trainability is fractal
☆208Updated last year
Alternatives and similar repositories for fractal
Users that are interested in fractal are comparing it to the libraries listed below
Sorting:
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆171Updated 2 years ago
- ☆150Updated 10 months ago
- Minimal GPT (~350 lines with a simple task to test it)☆62Updated 6 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆87Updated 3 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated 10 months ago
- ☆36Updated 6 months ago
- 🧱 Modula software package☆200Updated 2 months ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆72Updated 2 years ago
- ☆270Updated 11 months ago
- The history files when recording human interaction while solving ARC tasks☆112Updated 2 weeks ago
- Implementation of Diffusion Transformer (DiT) in JAX☆278Updated last year
- Tools for working with the Abstraction & Reasoning Corpus☆191Updated 10 months ago
- Resources from the EleutherAI Math Reading Group☆53Updated 3 months ago
- σ-GPT: A New Approach to Autoregressive Models☆65Updated 10 months ago
- Uncertainty quantification with PyTorch☆361Updated 2 months ago
- Deep Learning, an Energy Approach☆139Updated 2 weeks ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆251Updated 3 months ago
- Getting crystal-like representations with harmonic loss☆190Updated 2 months ago
- ☆190Updated 6 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆183Updated 9 months ago
- Flow-matching algorithms in JAX☆97Updated 10 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆140Updated last month
- ☆435Updated 8 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆91Updated last week
- Patched Attention for Nonlinear Dynamics☆148Updated 3 weeks ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆177Updated 2 weeks ago
- Efficient optimizers☆220Updated this week
- Compositional Linear Algebra☆477Updated last month
- My writings about ARC (Abstraction and Reasoning Corpus)☆77Updated this week
- Neural Networks and the Chomsky Hierarchy☆205Updated last year