francoisfleuret / picogpt
Minimal GPT (~350 lines with a simple task to test it)
☆62Updated last month
Alternatives and similar repositories for picogpt:
Users that are interested in picogpt are comparing it to the libraries listed below
- ☆58Updated 2 years ago
- σ-GPT: A New Approach to Autoregressive Models☆61Updated 5 months ago
- ☆149Updated 5 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆119Updated 2 weeks ago
- Visualizations of the theory behind diffusion models.☆77Updated 9 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆59Updated 6 months ago
- ☆27Updated 6 months ago
- ☆53Updated last year
- Flow-matching algorithms in JAX☆83Updated 5 months ago
- ☆40Updated 2 months ago
- Because we don't have enough time to read everything☆87Updated 4 months ago
- Graph neural networks in JAX.☆67Updated 7 months ago
- Diffusion models in PyTorch☆89Updated 3 months ago
- The boundary of neural network trainability is fractal☆194Updated 11 months ago
- Because we don't want a jupyter notebook mess...☆59Updated last month
- Jax like function transformation engine but micro, microjax☆30Updated 3 months ago
- ☆150Updated last month
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆69Updated last year
- Latent Diffusion Language Models☆68Updated last year
- Cellular Automata Accelerated in JAX☆79Updated 2 months ago
- ☆204Updated 6 months ago
- ☆78Updated 9 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆95Updated last month
- ☆25Updated last year
- Scripts to prep PC for development use after OS installs☆37Updated last week
- ☆14Updated last month
- Simple Transformer in Jax☆130Updated 7 months ago
- Run PyTorch in JAX. 🤝☆216Updated 3 weeks ago
- ☆20Updated 9 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆22Updated last week