Skip to content

User Guide

Choose a framework to get started:

  • vLLM — serve large language models on EC2, EKS, or Amazon SageMaker AI
  • vLLM-Omni — serve multimodal models (TTS, image, video, audio, omni-chat)
  • Ray — deploy any ML model with Ray Serve (NLP, vision, audio, tabular)
  • PyTorch — distributed training with EFA, NCCL, flash-attn, and DeepSpeed pre-installed
  • Base — lightweight CUDA + Python images for building your own AI/ML container