xrsrke / pipegoose

Large scale 4D parallelism pre-training for ๐Ÿค— transformers in Mixture of Experts *(still work in progress)*
โ˜†81Updated last year

Alternatives and similar repositories for pipegoose:

Users that are interested in pipegoose are comparing it to the libraries listed below