VatsaDev / NanoPoorLinks
NanoGPT-speedrunning for the poor T4 enjoyers
☆66Updated 2 months ago
Alternatives and similar repositories for NanoPoor
Users that are interested in NanoPoor are comparing it to the libraries listed below
Sorting:
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 3 months ago
- rl from zero pretrain, can it be done? we'll see.☆56Updated this week
- Collection of autoregressive model implementation☆85Updated 2 months ago
- supporting pytorch FSDP for optimizers☆82Updated 6 months ago
- ☆98Updated 5 months ago
- ☆49Updated last year
- NanoGPT (124M) quality in 2.67B tokens