AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart aws.amazon.com Post date May 2, 2024 No Comments on AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart External Tags Amazon SageMaker, Amazon SageMaker JumpStart, Announcements, artificial-intelligence, AWS Inferentia, AWS Trainium
Generative AI roadshow in North America with AWS and Hugging Face aws.amazon.com Post date April 2, 2024 No Comments on Generative AI roadshow in North America with AWS and Hugging Face External Tags Amazon SageMaker, AWS Inferentia, Events, generative-ai
Gradient makes LLM benchmarking cost-effective and effortless with AWS Inferentia aws.amazon.com Post date April 2, 2024 No Comments on Gradient makes LLM benchmarking cost-effective and effortless with AWS Inferentia External Tags Advanced (300), AWS Inferentia, generative-ai
Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium aws.amazon.com Post date January 17, 2024 No Comments on Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium External Tags Advanced (300), Amazon SageMaker, AWS Inferentia, Intermediate (200)
Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2 aws.amazon.com Post date December 13, 2023 No Comments on Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2 External Tags Amazon SageMaker, AWS Inferentia, generative-ai
Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch aws.amazon.com Post date October 26, 2023 No Comments on Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch External Tags artificial-intelligence, AWS Inferentia, Customer Solutions
Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2 aws.amazon.com Post date July 26, 2023 No Comments on Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2 External Tags Advanced (300), Amazon EC2, Amazon SageMaker, AWS Inferentia
Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances aws.amazon.com Post date July 24, 2023 No Comments on Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances External Tags AWS Inferentia, pytorch, PyTorch on AWS
Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators aws.amazon.com Post date June 20, 2023 No Comments on Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators External Tags AWS Inferentia, AWS Trainium, generative-ai, Intermediate (200), sustainability, Thought Leadership
AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency aws.amazon.com Post date June 13, 2023 No Comments on AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency External Tags Advanced (300), artificial-intelligence, AWS Inferentia