Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2 aws.amazon.com Post date July 26, 2023 No Comments on Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2 External Tags Advanced (300), Amazon EC2, Amazon SageMaker, AWS Inferentia
Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances aws.amazon.com Post date July 24, 2023 No Comments on Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances External Tags AWS Inferentia, pytorch, PyTorch on AWS
Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators aws.amazon.com Post date June 20, 2023 No Comments on Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators External Tags AWS Inferentia, AWS Trainium, generative-ai, Intermediate (200), sustainability, Thought Leadership
AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency aws.amazon.com Post date June 13, 2023 No Comments on AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency External Tags Advanced (300), artificial-intelligence, AWS Inferentia