tysam-code / hlb-gptLinks

Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to larger models with one parameter change (feature currently in alpha).
345Updated 10 months ago

Alternatives and similar repositories for hlb-gpt

Users that are interested in hlb-gpt are comparing it to the libraries listed below

Sorting: