Accelerate your generative AI distributed training workloads with the NVIDIA NeMo Framework on Amazon EKS

End-to-end LLM training on instance clusters with over 100 nodes using AWS Trainium

Amazon SageMaker model parallel library now accelerates PyTorch FSDP workloads by up to 20%

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium