frankxu2004 / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
17Updated 2 years ago

Alternatives and similar repositories for gpt-neox:

Users that are interested in gpt-neox are comparing it to the libraries listed below