Skip to content

AWS Deep Learning Containers for vLLM 0.19.1+amzn2023.6ef1efd5 on SageMaker

AWS Deep Learning Containers for SageMaker are now available with vLLM 0.19.1+amzn2023.6ef1efd5.

Announcements

  • Initial release of vLLM Server containers on Amazon Linux 2023 for SageMaker

  • Simplified tag format: server-sagemaker-cuda[-vMAJOR[.MINOR[.PATCH]]]

  • Built on Amazon Linux 2023 with Python 3.12 and CUDA 12.9

Core Packages

Package Version
vLLM 0.19.1+amzn2023.6ef1efd5
PyTorch 2.10.0
TorchVision 0.25.0
TorchAudio 2.10.0
Transformers 5.5.4
CUDA 12.9.1
NCCL 2.27.5
FlashInfer 0.6.7
EFA 1.47.0

Security Advisory

AWS recommends that customers monitor critical security updates in the AWS Security Bulletin.

Reference

Docker Image URIs

763104351884.dkr.ecr.us-west-2.amazonaws.com/vllm:server-sagemaker-cuda-v1

public.ecr.aws/deep-learning-containers/vllm:server-sagemaker-cuda-v1