soumyadip1995 / BabyGPTView on GitHub
Something in the middle of Karpathy's mingpt model and video lectures, BabyGPT is an easy to use model on a much smaller scale (16 and 256 out channels , 5 heads, fine tuned).
24Jan 13, 2026Updated last month

Alternatives and similar repositories for BabyGPT

Users that are interested in BabyGPT are comparing it to the libraries listed below

Sorting:

Are these results useful?