Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters

Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container

Accelerate deep learning training and simplify orchestration with AWS Trainium and AWS Batch

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

End-to-end LLM training on instance clusters with over 100 nodes using AWS Trainium