microsoft / deepspeed-gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
19Updated last year

Related projects

Alternatives and complementary repositories for deepspeed-gpt-neox