fattorib / Little-GPT

GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!
23Updated 2 years ago

Related projects

Alternatives and complementary repositories for Little-GPT