Skip to main content

Instance Types

Learn about the instance types we support


We offer a range of Amazon Web Services (AWS) instance types, designed to handle a variety of machine learning workloads. These instances vary in their CPU, RAM (Random Access Memory), and GPU configurations, which allow you to orchestrate the right balance of performance and cost for your use case.

info

You can contact us to unlock access to our most powerful instances, including NVIDIA A100 and H100 GPUs.

t3a Instances

The t3a series is designed for cost-effective, general-purpose workloads that do not require GPU acceleration. It offers a balanced combination of CPU and memory, making it ideal for lightweight applications.

Instance TypeGPUsTotal GPU RAMCPURAM
t3a.medium--2x CPU4GiB
t3a.large--2x CPU8GiB
t3a.xlarge--4x CPU16GiB
t3a.2xlarge--8x CPU32GiB

Key Features

  • vCPUs (virtual CPUs) — Burstable performance for intermittent, compute-heavy tasks. Ideal for CPU-intensive operations like running traditional models or pre-processing pipelines. For example, t3a.medium offers two vCPUs, while t3a.2xlarge offers eight vCPUs.

  • RAM — Determines the capacity for handling data in memory. It ranges from 4 GiB to 32 GiB, allowing you to handle lightweight, data-intensive workloads without requiring GPU acceleration.

Example Use Case

  • Running simple models for classification tasks.

g4dn Instances

The g4dn series is designed for moderate GPU-accelerated workloads, making it suitable for small-to-medium-scale machine learning tasks.

Instance TypeGPUsTotal GPU RAMCPURAM
g4dn.xlarge1x T416GiB4x CPU16GiB

Key Features

  • NVIDIA T4 GPUs — Optimized for inference and light model training, offering a balance of performance and cost.

  • vCPUs and RAM — Includes four vCPUs and 16 GiB of RAM for data processing and workload orchestration.

Example Use Cases

  • Inference workloads, such as running NLP models like BERT-base for text summarization and question answering.

  • Fine-tuning pre-trained models for specific tasks like object detection or sentiment analysis.

g5 Instances

The g5 series delivers enhanced GPU capabilities and is designed for tasks requiring higher memory and computational power, such as large-scale deep learning model training.

Instance TypeGPUsTotal GPU RAMCPURAM
g5.xlarge1x A10G24GiB4x CPU16GiB
g5.2xlarge1x A10G24GiB8x CPU32GiB

Key Features

  • NVIDIA A10G GPUs — High memory bandwidth and compute power for complex deep learning models and advanced workloads.

  • vCPUs and RAM — Increased CPU and memory for tasks involving heavy data processing alongside GPU computation.

Example Use Cases

  • Training mid-sized NLP models like GPT-2 or T5 for text generation, or training image segmentation models like UNet or Mask R-CNN for medical imaging.

  • Running object tracking or pose estimation workflows in real-time video analysis.

g6 Instances

The g6 series offers next-generation GPU technologies and is designed for the most demanding machine learning workloads, including large-scale model training and high-performance simulations. Each instance type in the g6 series is tailored to specific workloads.

Instance TypeGPUsTotal GPU RAMCPURAM
g6.xlarge1x L424GiB4x CPU16GiB
g6.2xlarge1x L424GiB8x CPU32GiB
g6e.xlarge1x L40S48GiB4x CPU32GiB
g6e.12xlarge4x L40S192GiB48x CPU384GiB

Key Features

  • Next-Gen GPUs — NVIDIA L4 and L40S GPUs deliver exceptional performance for training and inference tasks, with GPU memory scaling from 24 GiB to 192 GiB.

  • High vCPU & RAM Configurations — Ideal for handling massive datasets and parallel processing for complex workflows.

Example Use Cases

  • The g6.xlarge and g6.2xlarge instances support mid-tier workloads, such as fine-tuning the BERT-large model or running computer vision tasks like text-to-image generation.

  • The g6e.xlarge and g6e.12xlarge instances support high-end workloads, such as training large-scale language models like GPT-4 or T5-XL for multi-modal tasks.