NVIDIA A40

Bring your VFX visions to life faster and at a lower cost by using the NVIDIA A40 on demand with Vultr Cloud GPU. Provision A40s to meet diverse computing needs, with options available from fractions of a single NVIDIA A40 to complete A40 servers.

Combining professional graphics with powerful compute and AI, to meet today's design, creative, and scientific challenges.

NVIDIA A40
Starting at

$0.075

/ Per hour
Deploy now

Combining professional graphics with powerful compute and AI, to meet today's design, creative, and scientific challenges.

Pricing

NVIDIA A100 PCIe Starting at $0.075 / hour

Key features

Featuring NVIDIA Ampere architecture CUDA® cores, second-generation RT cores, and third-generation Tensor Cores, the NVIDIA A40 GPU combines best-in-class professional graphics with powerful acceleration for AI and compute workloads.

The Everywhere Cloud

The perfect virtual workstation

Working from home? Or maybe even the beach? Wherever you go, the computer you need for your most resource-intensive projects is always with you. Heavyweight applications such as Blender, Maya, OBS, and Cinema 4D run smoothly and quickly, allowing you to create 3D visualizations, animated films, and visual effects.

Virtualization meets visualization

The only cloud to
virtualize the NVIDIA A40

Vultr has made accelerated computing affordable by virtualizing the industry-leading GPU for visual computing: the NVIDIA A40. Our unique approach partitions physical GPUs into discrete virtual GPUs, each with their own memory and compute. Perfect for ray-traced rendering, simulation, virtual production, and CAD.

Powering high-performance virtual workstations cost-effectively

The NVIDIA A40 GPUs and NVIDIA RTX Virtual Workstation (vWS) offer high-performance cloud computing solutions across a vast range of applications. Explore the power of cutting-edge NVIDIA GPU technology for high-performance workstations.

No information is required for download

Low latency through
global availability

Vultr offers a global cloud GPU platform, ensuring you experience low latency, with near-native performance for virtual workstations.

Chicago, Illinois United States
Miami, Florida United States
Amsterdam Netherlands
New Jersey United States
Dallas, Texas United States
Paris France
Mexico City Mexico
São Paulo Brazil
Madrid Spain
Warsaw Poland
Tokyo Japan
Seattle, Washington United States
Los Angeles, California United States
Silicon Valley, California United States
Singapore
Atlanta, Georgia United States
London United Kingdom
Frankfurt Germany
Sydney Australia
Melbourne Australia
Toronto Canada
Seoul South Korea
Stockholm Sweden
Honolulu, Hawaii United States
Mumbai India
Bangalore India
Delhi NCR India
Santiago Chile
Tel Aviv-Yafo Israel
Johannesburg South Africa
Osaka Japan
Manchester United Kingdom
17 regions

Specifications

Our easy-to-use control panel and API let you spend more time building and less time managing your infrastructure.

GPU Memory

48 GB GDDR6 with error-correcting code (ECC)

GPU Memory Bandwidth

696 GB/s

Interconnect

NVIDIA NVLink 112.5 GB/s (bidirectional) PCIe Gen4: 64GB/s

NVLink

2-way low profile (2-slot)

Display Ports

3x DisplayPort 1.4

Max Power Consumption

300 W

Form Factor

4.4" (H) x 10.5" (L) Dual Slot

Thermal

Passive

vGPU Sofware Support

NVIDIA Virtual PC
NVIDIA Virtual Applications
NVIDIA RTX Virtual Workstation
NVIDIA Virtual Compute Server
NVIDIA AI Enterprise

vGPU Profiles Supported

See the Virtual GPU Licensing Guide

NVENC | NVDEC

1x | 2x (includes AV1 decode)

Secure and Measured Boot with Hardware Root of Trust

Yes (optional)

NEBS Ready

Level 3

Power Connector

8-pin CPU

Additional resources

Docs, demos, and information to help you succeed with your machine learning projects.

FAQ

What is the NVIDIA A40 GPU?

The NVIDIA A40 is a high-performance data center GPU designed for AI, deep learning, high-end visualization, and virtual workstation applications. It features 48GB of GDDR6 memory and third-generation Tensor Cores to accelerate workloads.

What are the key features of the NVIDIA A40?

  • 48GB GDDR6 Memory for handling large AI and deep learning datasets
  • 3rd Gen Tensor Cores for AI acceleration
  • 2nd Gen RT Cores for real-time ray tracing
  • PCIe Gen 4 Support for high-speed data transfer
  • Multi-instance GPU (MIG) support for virtualizing multiple workloads

What workloads are best suited for NVIDIA A40?

The NVIDIA A40 GPU is ideal for:

  • AI & Machine Learning – Deep learning training & inference
  • Rendering & Visualization – High-quality 3D rendering & design workflows
  • Data Science – Large-scale data processing & analytics
  • Virtualization – Multi-user GPU acceleration for cloud workstations
  • Video Processing – High-resolution video editing and transcoding
  • Virtual Desktops & Workstations – Resource intensive projects

Can I run deep learning models on an NVIDIA A40 GPU?

Yes, the NVIDIA A40 is optimized for AI and deep learning tasks. It supports frameworks like TensorFlow, PyTorch, and Keras, making it an excellent choice for machine learning workloads.

Can I use NVIDIA A40 for virtualization and cloud workstations?

Yes, NVIDIA A40 supports virtualization and is designed for cloud-based workstations, remote desktop applications, and virtualized AI workloads.

Does the NVIDIA A40 support ray tracing?

Yes, the NVIDIA A40 includes 2nd-generation RT Cores, which enable real-time ray tracing for rendering and visualization applications.

Can I run multiple workloads on a single NVIDIA A40 GPU?

Yes, the A40 supports a Multi-Instance GPU (MIG), which allows multiple users or workloads to share the GPU while maintaining high performance.

How does NVIDIA A40 handle large-scale data workloads?

With 48GB of GDDR6 memory, the NVIDIA A40 is built for high-memory-demanding workloads like AI, simulation, and big data analytics.

Can I use NVIDIA A40 in multi-GPU configurations?

Yes, A40 supports multi-GPU configurations, allowing for scalable HPC and AI model training.

How does the NVIDIA A40 compare to other data center GPUs for inference workloads?

The NVIDIA A40 is powerful for AI inference thanks to its third-generation Tensor Cores, which deliver high throughput for INT8 and FP16 operations. Compared to the A100, which excels in training, the A40 provides a more cost-effective solution for deploying inference at scale, especially in cloud-based environments that require virtualization and MIG (Multi-Instance GPU) support.

What are the performance implications of using the NVIDIA A40 with PCIe Gen 4 in cloud deployments?

PCIe Gen 4 support on the NVIDIA A40 enables higher data throughput between the GPU and CPU, reducing bottlenecks in data-intensive workloads like real-time rendering and AI inference. When deployed in cloud environments like Vultr's, this translates to faster model loading times, improved rendering speeds, and lower latency for edge applications.

In what scenarios is the Multi-Instance GPU (MIG) feature on the NVIDIA A40 most beneficial?

MIG is ideal for virtualized environments with multiple users or workloads sharing a single GPU. On the A40, MIG allows enterprises to partition GPU resources securely across virtual machines. It is well-suited for hosting virtual workstations, AI model serving, and multi-tenant SaaS platforms that require GPU acceleration without overprovisioning.

How does ray tracing performance on the NVIDIA A40 impact cloud-based rendering pipelines?

The A40’s second-generation RT Cores provide significant ray tracing acceleration, making it a strong choice for VFX studios and design teams using cloud-based render farms. Vultr’s GPU cloud enables real-time visualization, interactive design reviews, and faster final frame rendering without needing on-prem infrastructure.

What is the NVIDIA A40 GPU?

The NVIDIA A40 is a high-performance data center GPU designed for AI, deep learning, high-end visualization, and virtual workstation applications. It features 48GB of GDDR6 memory and third-generation Tensor Cores to accelerate workloads.

Get started with the
world’s largest privately-held cloud
infrastructure company

Create an account