Skip to main content

Supported Cloud Instances

Learn about the instance types we support


We offer a range of instance types designed to handle a variety of machine learning workloads. These cloud instances vary in their CPU, RAM (Random Access Memory), and GPU configurations, which allow you to orchestrate the right balance of performance and cost for your use case.

Amazon Web Services (AWS) Instances

T3A Instances

The t3a series is designed for cost-effective, general-purpose workloads that do not require GPU acceleration. It offers a balanced combination of CPU and memory, making it ideal for lightweight applications.

Instance TypeGPUsTotal GPU RAMCPURAM
t3a.medium--2x CPU4GiB
t3a.large--2x CPU8GiB
t3a.xlarge--4x CPU16GiB
t3a.2xlarge--8x CPU32GiB

Key Features

  • vCPUs (virtual CPUs) — Burstable performance for intermittent, compute-heavy tasks. Ideal for CPU-intensive operations like running traditional models or pre-processing pipelines. For example, t3a.medium offers two vCPUs cores, while t3a.2xlarge offers eight vCPUs.

  • RAM — Determines the capacity for handling data in memory. It ranges from 4 GiB to 32 GiB, allowing you to handle lightweight, data-intensive workloads without requiring GPU acceleration.

Example Use Case

  • Running simple models for classification tasks.

G4DN Instances

The g4dn series is designed for moderate GPU-accelerated workloads, making it suitable for small-to-medium-scale machine learning tasks.

Instance TypeGPUsTotal GPU RAMCPURAM
g4dn.xlarge1x NVIDIA-T416GiB4x CPU16GiB

Key Features

  • NVIDIA T4 GPUs — Optimized for inference and light model training, offering a balance of performance and cost.

  • vCPUs and RAM — Includes four vCPUs and 16 GiB of RAM for data processing and workload orchestration.

Example Use Cases

  • Inference workloads, such as running NLP models like BERT-base for text summarization and question answering.

  • Fine-tuning pre-trained models for specific tasks like object detection or sentiment analysis.

G5 Instances

The g5 series delivers enhanced GPU capabilities and is designed for tasks requiring higher memory and computational power, such as large-scale deep learning model training.

Instance TypeGPUsTotal GPU RAMCPURAM
g5.xlarge1x NVIDIA-A10G24GiB4x CPU16GiB
g5.2xlarge1x NVIDIA-A10G24GiB8x CPU32GiB

Key Features

  • NVIDIA A10G GPUs — High memory bandwidth and compute power for complex deep learning models and advanced workloads.

  • vCPUs and RAM — Increased CPU and memory for tasks involving heavy data processing alongside GPU computation.

Example Use Cases

  • Training mid-sized NLP models like GPT-2 or T5 for text generation, or training image segmentation models like UNet or Mask R-CNN for medical imaging.

  • Running object tracking or pose estimation workflows in real-time video analysis.

G6 Instances

The g6 series offers next-generation GPU technologies and is designed for the most demanding machine learning workloads, including large-scale model training and high-performance simulations. Each instance type in the g6 series is tailored to specific workloads.

Instance TypeGPUsTotal GPU RAMCPURAM
g6.xlarge1x NVIDIA-L424GiB4x CPU16GiB
g6.2xlarge1x NVIDIA-L424GiB8x CPU32GiB
g6e.xlarge1x NVIDIA-L40S48GiB4x CPU32GiB
g6e.12xlarge4x NVIDIA-L40S192GiB48x CPU384GiB

Key Features

  • Next-Gen GPUs — NVIDIA L4 and L40S GPUs deliver exceptional performance for training and inference tasks, with GPU memory scaling from 24 GiB to 192 GiB.

  • High vCPU & RAM Configurations — Ideal for handling massive datasets and parallel processing for complex workflows.

Example Use Cases

  • The g6.xlarge and g6.2xlarge instances support mid-tier workloads, such as fine-tuning the BERT-large model or running computer vision tasks like text-to-image generation.

  • The g6e.xlarge and g6e.12xlarge instances support high-end workloads, such as training large-scale language models like GPT-4 or T5-XL for multi-modal tasks.

Google Cloud Platform (GCP) Instances

N2-Standard Instances

The n2-standard series is designed for cost-effective, general-purpose workloads that do not require GPU acceleration. These instances provide a balanced combination of CPU and memory, making them ideal for lightweight applications.

Instance TypeGPUsTotal GPU RAMCPURAM
n2-standard-2--2x CPU8 GiB
n2-standard-4--4x CPU16 GiB
n2-standard-8--8x CPU32 GiB
n2-standard-16--16x CPU64 GiB

Key Features

  • vCPUs (Virtual CPUs) — Optimized for CPU-intensive operations like running traditional models or pre-processing pipelines.
  • RAM — Ranges from 8 GiB to 64 GiB, allowing efficient handling of lightweight, data-intensive workloads.

Example Use Case

  • Running small-scale machine learning models or serving simple inference workloads.

G2-Standard Instances

The g2-standard series is designed for moderate GPU-accelerated workloads, making it ideal for small-to-medium-scale machine learning tasks.

Instance TypeGPUsTotal GPU RAMCPURAM
g2-standard-41x NVIDIA-L424 GiB4x CPU16 GiB
g2-standard-81x NVIDIA-L424 GiB8x CPU32 GiB
g2-standard-121x NVIDIA-L424 GiB12x CPU48 GiB
g2-standard-161x NVIDIA-L424 GiB16x CPU64 GiB
g2-standard-321x NVIDIA-L424 GiB32x CPU128 GiB

Key Features

  • NVIDIA L4 GPUs — Optimized for inference and light model training, offering a balance of performance and cost.
  • Scalable vCPUs and RAM — Supports larger data processing and orchestration workloads.

Example Use Cases

  • Running NLP models like BERT-base for text summarization.
  • Fine-tuning small vision models for object detection.

A2 & A3 High-Performance Instances

For large-scale deep learning and AI workloads, the a2 and a3 series provide cutting-edge GPUs with high memory bandwidth.

Instance TypeGPUsTotal GPU RAMCPURAM
a2-ultragpu-1g1x NVIDIA-A10080 GiB12x CPU170 GiB
a3-highgpu-1g1x NVIDIA-H10080 GiB26x CPU234 GiB

Key Features

  • NVIDIA A100 & H100 GPUs — Designed for high-end AI and deep learning tasks, including large-scale model training.
  • High CPU & RAM Configurations — Enables parallel processing for massive datasets and complex workflows.

Example Use Cases

  • Training large language models like GPT-4 or T5-XL.
  • Running real-time AI applications, such as video analytics or autonomous systems.

Vultr Cloud Servers Instances

VC2 Instances

The vc2 series provides a range of general-purpose instances suitable for tasks that do not require GPU acceleration. These instances offer a balance of CPU and RAM for lightweight applications and traditional computational workloads.

Instance TypeGPUsTotal GPU RAMCPURAM
vc2-2c-4gb--2x CPU4GiB
vc2-4c-8gb --4x CPU8GiB
vc2-6c-16gb--6x CPU16GiB
vc2-8c-32gb--8x CPU32GiB
vc2-16c-64gb--16x CPU64GiB
vc2-24c-96gb--24x CPU96GiB

Key Features

  • They offer a range of scalable CPU and RAM configurations, from 2 CPU cores and 4GiB RAM in the vc2-2c-4gb instance to 24 CPU cores and 96GiB RAM in the vc2-24c-96gb instance.

Example Use Cases

  • Suitable for lightweight applications and development environments.

VCG Instances

The vcg series includes instances equipped with NVIDIA GPUs, designed for workloads that benefit from GPU acceleration, such as deep learning and other GPU-intensive applications.

Instance TypeGPUsTotal GPU RAMCPURAM
vcg-a16-6c-64g-16vram1x NVIDIA-A1616GiB6x CPU64GiB
vcg-a100-12c-120g-80vram1x NVIDIA-A10080GiB12x CPU120GiB
vcg-l40s-16c-180g-48vram1x NVIDIA-L40S48GiB16x CPU180GiB

Key Features

  • NVIDIA GPUs — Provides powerful GPU acceleration for demanding workloads.
  • High vCPU and RAM Configurations — Supports complex tasks and large datasets.

Example Use Cases

  • High-performance training and inference for large-scale deep learning models.
  • Running AI inference workloads with optimized GPU acceleration.

Oracle Distributed Cloud Instances

VM.Standard.E2.1.Micro

The VM.Standard.E2.1.Micro instance provides a cost-effective option for general-purpose lightweight workloads.

Instance TypeGPUsTotal GPU RAMCPURAM
VM.Standard.E2.1.Micro--1x CPU1GiB

Key Features

  • Offers a single CPU core and 1GB of RAM.

Example Use Cases

  • Suitable for basic tasks and small-scale applications.

VM.GPU.A10.1

The VM.GPU.A10.1 instance is designed for GPU-intensive workloads.

Instance TypeGPUsTotal GPU RAMCPURAM
VM.GPU.A10.11x NVIDIA-A10G24GiB15x CPU240GiB

Key Features

  • NVIDIA A10G GPU — Provides high-performance GPU acceleration for demanding tasks.
  • High vCPU and RAM configuration — Supports complex computations and large datasets.

Example Use Cases

  • High-performance computing and rendering workloads.