tczhangzhi / pytorch-parallel

Optimize an example model with Python, CPP, and CUDA extensions and Ring-Allreduce.
108Updated 5 years ago

Related projects: