Cloud GPU
High-performance computing
The NVIDIA HGX H100 platform is a high-performance computing (HPC) system designed specifically for artificial intelligence (AI) and HPC workloads.
30x faster performance
The NVIDIA HGX H100 features fourth-generation Tensor Cores and the Transformer Engine with FP8 precision, providing up to 30x faster performance over the prior generation for AI inference with 530B-parameter large language models.
8 x NVIDIA H100 SXM 80GB GPUs
8 x 3.84 TB NVMe
2 x 480 GB SSD
2 x Intel Platinum 8480+
112 cores / 224 threads @ 2GHZ
2048 GB Memory
15 TB Bandwidth
100 Gbps Network
Availability
Mini-Cluster 64 H100 GPUs
Next Availability December 15, 2023
Base-Cluster 248 H100 GPUs
Next Availability January 1, 2024
Pricing
Key Features
One of the key benefits of the NVIDIA HGX H100 is its ability to handle large, complex models and datasets easily. This makes it ideal for use in the healthcare, finance, and manufacturing industries, where data-intensive tasks are common.
Reduce compute costs with over 30x higher inference throughput, and increase productivity with over 6x improvement on typical HPC workloads.