francoisfleuret / picogpt
Minimal GPT (~350 lines with a simple task to test it)
☆62Updated 2 months ago
Alternatives and similar repositories for picogpt:
Users that are interested in picogpt are comparing it to the libraries listed below
- ☆53Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆60Updated 6 months ago
- ☆149Updated 6 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆59Updated 7 months ago
- The boundary of neural network trainability is fractal☆195Updated last year
- ☆33Updated last month
- Flow-matching algorithms in JAX☆85Updated 6 months ago
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025).☆81Updated last week
- Diffusion models in PyTorch☆92Updated last week
- ☆40Updated 3 months ago
- ☆59Updated 2 years ago
- ☆15Updated 2 months ago
- ☆36Updated 2 months ago
- Visualizations of the theory behind diffusion models.☆79Updated 10 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆84Updated last week
- Jax like function transformation engine but micro, microjax☆30Updated 4 months ago
- ☆25Updated last year
- Code for the book "The Elements of Differentiable Programming".☆73Updated 3 weeks ago
- Because we don't want a jupyter notebook mess...☆62Updated 2 months ago
- ☆31Updated 10 months ago
- ☆122Updated 2 weeks ago
- Run PyTorch in JAX. 🤝☆222Updated 2 weeks ago
- supporting pytorch FSDP for optimizers☆77Updated 2 months ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆87Updated 3 months ago
- ☆42Updated 2 months ago
- Graph neural networks in JAX.☆67Updated 8 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆120Updated 3 weeks ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆96Updated 2 months ago
- ☆57Updated 3 months ago