microsoft / deepspeed-gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
19Updated 2 years ago

Alternatives and similar repositories for deepspeed-gpt-neox:

Users that are interested in deepspeed-gpt-neox are comparing it to the libraries listed below