daekeun-ml / sm-distributed-training-step-by-step

This repository provides hands-on labs on PyTorch-based Distributed Training and SageMaker Distributed Training. It is written to make it easy for beginners to get started, and guides you through step-by-step modifications to the code based on the most basic BERT use cases.
13Updated last year

Related projects

Alternatives and complementary repositories for sm-distributed-training-step-by-step