Scaling distributed training with AWS Trainium and Amazon EKS
AWS Machine Learning
FEBRUARY 1, 2023
Many enterprise customers choose to deploy their deep learning workloads using Kubernetes—the de facto standard for container orchestration in the cloud. These images contain the Neuron SDK (excluding the Neuron driver, which runs directly on the Trn1 instances), PyTorch training script, and required dependencies.
Let's personalize your content