francoisfleuret / picogptLinks
Minimal GPT (~350 lines with a simple task to test it)
☆63Updated 2 weeks ago
Alternatives and similar repositories for picogpt
Users that are interested in picogpt are comparing it to the libraries listed below
Sorting:
- Jax like function transformation engine but micro, microjax☆33Updated last year
- The boundary of neural network trainability is fractal☆221Updated last year
- Getting crystal-like representations with harmonic loss☆192Updated 8 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- ☆211Updated last year
- A practical guide to diffusion models, implemented from scratch.☆164Updated this week
- ☆44Updated last month
- Because we don't want a jupyter notebook mess...☆61Updated 6 months ago
- ☆53Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 4 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated 11 months ago
- Video Diffusion Model. Autoregressive, long context, efficient training and inference. WIP☆34Updated 3 months ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆105Updated last year
- Graph neural networks in JAX.☆68Updated last year
- ☆82Updated last year
- Fast singularity detection with kernel☆37Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- ☆28Updated last year
- ☆34Updated last year
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆85Updated 3 months ago
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated last month
- ☆58Updated 3 weeks ago
- A Python Library for Learning Non-Euclidean Representations☆67Updated 4 months ago
- lossily compress representation vectors using product quantization☆59Updated last month
- ☆56Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆32Updated 2 years ago
- An introduction to LLM Sampling☆79Updated 11 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 10 months ago
- Diffusion models in PyTorch☆116Updated last week
- ☆60Updated 3 years ago