Scale AI training and inference for drug discovery through Amazon EKS and Karpenter aws.amazon.com Post date April 19, 2024 No Comments on Scale AI training and inference for drug discovery through Amazon EKS and Karpenter External Tags AI/ML, Amazon EC2, Amazon EC2 Container Registry, Amazon Elastic Kubernetes Service, best-practices, Customer Solutions, generative-ai, Technical How-to
Large language model inference over confidential data using AWS Nitro Enclaves aws.amazon.com Post date March 12, 2024 No Comments on Large language model inference over confidential data using AWS Nitro Enclaves External Tags Amazon EC2, AWS Key Management Service, Customer Solutions, Expert (400), Healthcare, Technical How-to
Introducing three new NVIDIA GPU-based Amazon EC2 instances aws.amazon.com Post date November 27, 2023 No Comments on Introducing three new NVIDIA GPU-based Amazon EC2 instances External Tags Amazon EC2, Announcements
Amazon EC2 DL2q instance for cost-efficient, high-performance AI inference is now generally available aws.amazon.com Post date November 22, 2023 No Comments on Amazon EC2 DL2q instance for cost-efficient, high-performance AI inference is now generally available External Tags Amazon EC2, artificial-intelligence, Customer Solutions
Enable pod-based GPU metrics in Amazon CloudWatch aws.amazon.com Post date September 7, 2023 No Comments on Enable pod-based GPU metrics in Amazon CloudWatch External Tags Advanced (300), Amazon CloudWatch, Amazon EC2, Amazon Elastic Kubernetes Service
Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2 aws.amazon.com Post date July 26, 2023 No Comments on Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2 External Tags Advanced (300), Amazon EC2, Amazon SageMaker, AWS Inferentia
Accelerate PyTorch with DeepSpeed to train large language models with Intel Habana Gaudi-based DL1 EC2 instances aws.amazon.com Post date June 7, 2023 No Comments on Accelerate PyTorch with DeepSpeed to train large language models with Intel Habana Gaudi-based DL1 EC2 instances External Tags Advanced (300), Amazon EC2