Sohl-Dickstein / fractal
The boundary of neural network trainability is fractal
☆195Updated last year
Alternatives and similar repositories for fractal:
Users that are interested in fractal are comparing it to the libraries listed below
- ☆149Updated 7 months ago
- Visualizations of the theory behind diffusion models.☆148Updated 11 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆219Updated 2 weeks ago
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆165Updated last year
- ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).☆214Updated last week
- 🧱 Modula software package☆173Updated 2 weeks ago
- Minimal GPT (~350 lines with a simple task to test it)☆62Updated 3 months ago
- For optimization algorithm research and development.☆498Updated this week
- ☆36Updated 3 months ago
- ☆169Updated 3 months ago
- ☆420Updated 5 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆269Updated 9 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 7 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆135Updated last week
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆72Updated last week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆186Updated 9 months ago
- Puzzles for exploring transformers☆333Updated last year
- WIP☆93Updated 7 months ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆169Updated 3 months ago
- Uncertainty quantification with PyTorch☆346Updated last week
- The history files when recording human interaction while solving ARC tasks☆97Updated this week
- A simple implimentation of Bayesian Flow Networks (BFN)☆240Updated last year
- ☆214Updated 8 months ago
- Resources from the EleutherAI Math Reading Group☆53Updated 3 weeks ago
- Flow-matching algorithms in JAX☆86Updated 7 months ago
- Tools for working with the Abstraction & Reasoning Corpus☆180Updated 7 months ago
- Interactive textbook on state-space models☆184Updated last year
- Machine Learning with Symbolic Tensors☆262Updated 2 weeks ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆67Updated last month