francoisfleuret / picogpt
Minimal GPT (~350 lines with a simple task to test it)
☆62Updated 3 months ago
Alternatives and similar repositories for picogpt:
Users that are interested in picogpt are comparing it to the libraries listed below
- Visualizations of the theory behind diffusion models.☆172Updated 11 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago
- ☆87Updated 3 weeks ago
- Graph neural networks in JAX.☆67Updated 9 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 8 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 2 months ago
- Flow-matching algorithms in JAX☆86Updated 7 months ago
- ☆27Updated 8 months ago
- ☆150Updated 7 months ago
- ☆60Updated 3 years ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆84Updated last week
- ☆25Updated last year
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆84Updated this week
- ☆53Updated last year
- ☆42Updated last week
- NanoGPT-speedrunning for the poor T4 enjoyers☆49Updated this week
- ☆55Updated 4 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆23Updated 2 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆98Updated 3 months ago
- Code for minimum-entropy coupling.☆31Updated 9 months ago
- ☆52Updated 6 months ago
- Implementation of PSGD optimizer in JAX☆30Updated 3 months ago
- ☆33Updated 6 months ago
- Induce brain-like topographic structure in your neural networks☆53Updated 3 weeks ago
- Because we don't want a jupyter notebook mess...☆62Updated 2 weeks ago
- Run PyTorch in JAX. 🤝☆232Updated last month
- ☆44Updated last month
- ☆41Updated 3 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆135Updated 3 weeks ago
- ☆36Updated 3 months ago