Skip to content

Deep Learning Containers

User Guide

aws/deep-learning-containers

Deep Learning Containers

aws/deep-learning-containers

Home
User Guide
User Guide
- User Guide
- vLLM
  vLLM
  - Overview
  - Supported Models
  - Deployment
    Deployment
    
    EC2
    
    EKS
    
    Amazon SageMaker AI
  - Configuration
  - Changelog
- vLLM-Omni
  vLLM-Omni
  - Overview
  - Supported Models
  - Deployment
    Deployment
    
    EC2
    
    Amazon SageMaker AI
  - Configuration
  - Changelog
- SGLang
  SGLang
  - Overview
  - Supported Models
  - Deployment
    Deployment
    
    EC2
    
    EKS
    
    Amazon SageMaker AI
  - Configuration
  - Changelog
- TEI
  TEI
  - Overview
  - Deployment
    Deployment
    
    Amazon SageMaker AI
- Ray
  Ray
  - Overview
  - Deployment
    Deployment
    
    EC2
    
    Amazon SageMaker AI
  - Changelog
- PyTorch
  PyTorch
  - Overview
  - Deployment
    Deployment
    
    EC2
    
    Amazon SageMaker AI
  - Changelog
- TensorFlow
  TensorFlow
  - Overview
  - Deployment
    Deployment
    
    Amazon SageMaker AI
  - Changelog
- Base
  Base
  - Overview
  - Changelog
Blog Posts
Blog Posts
- Tutorials
- Training
  Training
  - EKS Training
  - Distributed Fraud Detection (XGBoost)
- Inference
  Inference
- Integrations
  Integrations
  - MLflow
  - SOCI
Resources
Resources
- Resources
- Reference
  Reference
- Security
  Security

User Guide¶

Choose a framework to get started:

vLLM — serve large language models on EC2, EKS, or Amazon SageMaker AI
vLLM-Omni — serve multimodal models (TTS, image, video, audio, omni-chat)
TEI — serve text embedding, reranker, and classification models with Text Embeddings Inference on Amazon SageMaker AI
Ray — deploy any ML model with Ray Serve (NLP, vision, audio, tabular)
PyTorch — distributed training with EFA, NCCL, flash-attn, and DeepSpeed pre-installed
TensorFlow — training on Amazon SageMaker AI with EFA-capable multi-node support on Amazon Linux 2023
Base — lightweight CUDA + Python images for building your own AI/ML container