tqfang / comet-deepspeed

Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.
14Updated 2 years ago

Related projects: