Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Use mobility data to derive insights using Amazon SageMaker geospatial capabilities

Host the Whisper Model on Amazon SageMaker: exploring inference options

Build financial search applications using the Amazon Bedrock Cohere multilingual embedding model

Ball position tracking in the cloud with the PGA TOUR

Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention

Inference Llama 2 models with real-time response streaming using Amazon SageMaker

Modernizing data science lifecycle management with AWS and Wipro

Amazon SageMaker model parallel library now accelerates PyTorch FSDP workloads by up to 20%