HeegyuKim / torch-xla-SPMDLinks
Pytorch/XLA SPMD Test code in Google TPU 
☆23Updated last year
Alternatives and similar repositories for torch-xla-SPMD
Users that are interested in torch-xla-SPMD are comparing it to the libraries listed below
Sorting:
- ☆121Updated last year
 - some common Huggingface transformers in maximal update parametrization (µP)☆86Updated 3 years ago
 - Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated last year
 - A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆69Updated last year
 - Understand and test language model architectures on synthetic tasks.☆234Updated last month
 - Multipack distributed sampler for fast padding-free training of LLMs☆201Updated last year
 - Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆85Updated last year