GPU Architecture | NVIDIA Ampere |
GPU memory | 48 GB GDDR6 with ECC |
Memory bandwidth | 696 GB/s |
Interconnect interface | NVIDIA® NVLink® 112.5 GB/s (bidirectional)3 PCIe Gen4: 64GB/s |
NVIDIA Ampere architecture- based CUDA Cores | 10,752 |
NVIDIA second-generation RT Cores | 84 |
NVIDIA third-generation Tensor Cores | 336 |
Peak FP32 TFLOPS (non-Tensor) | 37.4 |
Peak FP16 Tensor TFLOPS with FP16 Accumulate | 149.7 | 299.4* |
Peak TF32 Tensor TFLOPS | 74.8 | 149.6* |
RT Core performance TFLOPS | 73.1 |
Peak BF16 Tensor TFLOPS with FP32 Accumulate | 149.7 | 299.4* |
Peak INT8 Tensor TOPS Peak INT 4 Tensor TOPS | 299.3 | 598.6* 598.7 | 1,197.4* |
Form factor | 4.4" (H) x 10.5" (L) dual slot |
Display ports | 3x DisplayPort 1.4**; Supports NVIDIA Mosaic and Quadro® Sync4 |
Max power consumption | 300w |
Power connector | 8-pin CPU |
Thermal solution | Passive |
vGPU profiles supported | See the Virtual GPU Licensing Guide |
Virtual GPU (vGPU) software support | NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation, NVIDIA Virtual Compute Server |