xrsrke / pipegoose
View external linksLinks

Large scale 4D parallelism pre-training for πŸ€— transformers in Mixture of Experts *(still work in progress)*
β˜†86Dec 14, 2023Updated 2 years ago

Alternatives and similar repositories for pipegoose

Users that are interested in pipegoose are comparing it to the libraries listed below

Sorting:

Are these results useful?