Sohl-Dickstein / fractal
The boundary of neural network trainability is fractal
☆198Updated last year
Alternatives and similar repositories for fractal
Users that are interested in fractal are comparing it to the libraries listed below
Sorting:
- ☆150Updated 9 months ago
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆170Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆82Updated 2 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆275Updated 11 months ago
- Minimal GPT (~350 lines with a simple task to test it)☆62Updated 5 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆233Updated 2 months ago
- Video Diffusion Model. Autoregressive, long context, efficient training and inference☆28Updated this week
- Tools for working with the Abstraction & Reasoning Corpus☆187Updated 9 months ago
- 🧱 Modula software package☆189Updated last month
- Getting crystal-like representations with harmonic loss☆183Updated last month
- ☆180Updated 5 months ago
- For optimization algorithm research and development.☆513Updated this week
- Hierarchical Associative Memory User Experience☆101Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆117Updated 11 months ago
- ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).☆238Updated last week
- ☆217Updated 10 months ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆71Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆64Updated 9 months ago
- Flow-matching algorithms in JAX☆90Updated 9 months ago
- ☆431Updated 6 months ago
- Resources from the EleutherAI Math Reading Group☆53Updated 2 months ago
- Neural Networks and the Chomsky Hierarchy☆206Updated last year
- The history files when recording human interaction while solving ARC tasks☆109Updated 2 weeks ago
- ☆36Updated 5 months ago
- Domain Specific Language for the Abstraction and Reasoning Corpus☆254Updated 7 months ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆174Updated this week
- A simple implimentation of Bayesian Flow Networks (BFN)☆240Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 9 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆183Updated 8 months ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆553Updated 10 months ago