The NVIDIA HGX B300 provides the compute needed to handle the growing acceleration requirements of generative AI and HPC workloads. With NVIDIA Blackwell Ultra GPUs integrated with high-speed interconnect and a total of 2.1 TB fast memory, NVIDIA HGX B300s offer 108 FLOPS dense FP4 performance for increased AI throughput. Unleash the power of AI reasoning with up to 30x greater AI factory output compared to Hopper-generation GPUs.
Capacity available soon
Reserve capacity at Vultr now to access the latest in NVIDIA acceleration.
Key features
Accelerated by eight NVIDIA Blackwell Ultra GPUs with 2.1 TB aggregate GPU memory and 64 TB/s total memory bandwidth
Connected by NVIDIA 5th Generation NVLink™ 1.8 TB/s interconnect
Dedicated Blackwell Decompression Engine accelerates data queries for large datasets at up to 800 GB/s
AI inference, agentic AI, AI training, and high-performance computing (HPC) workloads require multiple GPUs with extremely fast interconnections and a fully accelerated software stack. The NVIDIA HGX™ AI supercomputing platform brings together the full power of NVIDIA GPUs, NVIDIA NVLink™, NVIDIA networking, and fully optimized AI and HPC software stacks to provide the highest application performance and drive the fastest time to value.
No information is required for download
Vultr Clusters delivers on-demand NVIDIA HGX B300 clusters with high-speed GPU fabric. Provision and configure Vultr Cloud GPU or Vultr Bare Metal clusters as needed without special requests or required reservations. Easily add high-performance storage, monitor key cluster metrics, and schedule workloads with Slurm or Kubernetes. Deploying and managing clusters is a simple process through the Vultr Console or API, enabling seamless infrastructure management for varying workload needs.
No information is required for download
Vultr's enterprise-ready infrastructure seamlessly supports any cluster size of NVIDIA Blackwell Ultra GPUs. Whether you require a small cluster or a massive deployment, Vultr ensures reliable, high-performance computing for your specific needs.
Large clusters of NVIDIA Blackwell Ultra GPUs are available where you need them, thanks to Vultr’s far-reaching infrastructure. With 33 global cloud data center regions across six continents, we guarantee low latency and high availability, enabling your enterprise to achieve optimal performance worldwide.
Vultr ensures our platform, products, and services meet diverse global compliance, privacy, and security needs, covering areas such as server availability, data protection, and privacy. Our commitment to industry-wide privacy and security frameworks, including ISO, HIPAA, and SOC 2 Type 2 standards, demonstrates our dedication to protecting our customers' data.
The NVIDIA HGX B300 delivers agentic AI and HPC workloads efficiently and cost-effectively
The NVIDIA HGX B300 provides the compute power to handle the growing complexity and size of modern AI workloads. Leverage NVIDIA HGX B300s on Vultr to accelerate demanding applications affordably on a composable, global platform.
30x Greater*
2.6x Greater*
*As compared to NVIDIA Hopper GPUs
NVIDIA HGX B300
NVIDIA Blackwell Ultra GPUs
8
Fast Memory
2.1 TB
NVLink GPU-to-GPU Bandwidth
1.8 TB/s
Aggregate NVLink Bandwidth
14.4 TB/s
FP4 Tensor Core
144 petaFLOPS¹
FP8/FP6 Tensor Core
72 petaFLOPS¹
INT8 Tensor Core
72 petaLOPS¹
GPU Memory
270 GB HBM3E per GPU
Decoders/GPU
7 NVDEC | 7 NVJPEG
Interconnect
5th Generation NVIDIA NVLink™
¹Specification in sparse.
The NVIDIA HGX B300 includes eight NVIDIA Blackwell Ultra GPUs connected by high-speed interconnect. They deliver a total of 2.1 TB of fast memory, with 108 FLOPS dense FP4 performance and with AI factory output up to 30x greater than Hopper-generation systems.
The NVIDIA HGX B300 provides optimal performance for Generative AI and HPC workloads. These workloads require fast interconnections and an accelerated software stack, driven by the NVIDIA HGX AI supercomputing platform (comprising NVIDIA GPUs, NVIDIA NVLink™, NVIDIA networking, and more).
Increasingly complex AI workloads rely on multi-step reasoning, which needs large context windows with up to 100x greater compute capacity than required for single-pass inference queries. NVIDIA HGX B300s provide the acceleration necessary to support these context windows with lower energy consumption and cost per token.
Vultr provides leading price-to-performance across 33 global cloud data center regions, enabling efficient, flexible, cost-effective, scalable deployments worldwide with minimal management overhead and enterprise-grade security and compliance.
Vultr GPU Enabled Images contain NVIDIA software such as the NVIDIA CUDA toolkit, simplifying setup and deployment even further on Vultr’s easy-to-use control panel or API.
The NVIDIA HGX B300 includes eight NVIDIA Blackwell Ultra GPUs connected by high-speed interconnect. They deliver a total of 2.1 TB of fast memory, with 108 FLOPS dense FP4 performance and with AI factory output up to 30x greater than Hopper-generation systems.