Preview live today, available globally soon
We are announcing that a preview is now live for Azure HBv3 virtual machines enhanced by 3rd Gen AMD EPYC™ processors with AMD 3D V-cache, codenamed “Milan-X”. These processors significantly improve the performance, scaling efficiency, and cost-effectiveness of a variety of memory performance-bound workloads such as CFD, explicit finite element analysis, computational geoscience, weather simulation, and silicon design RTL workflows.
Compared to the current HBv3-series with 3rd Gen AMD EPYC processors, already the highest performance VM for HPC workloads on the public cloud, customers will experience up to:
◉ 80 percent higher performance for CFD
◉ 60 percent higher performance for EDA RTL
◉ 50 percent higher performance for explicit FEA
◉ 19 percent higher performance for weather simulation
In addition, all HBv3-series VMs globally will soon be upgraded with Milan-X processors. This upgrade will be provided at no additional cost beyond existing pricing for HBv3-series VMs, and with no changes required of customer workloads. No other changes are being made to the HBv3-series VM sizes customers already know and rely on for their critical research and business workloads.
Turbocharging memory performance-bound HPC workloads
Many HPC workloads are driven foremost by memory performance. For some, such as computational fluid dynamics, performance is directly driven by memory bandwidth. For others, such as RTL simulation that is the workhorse application for silicon design firms, this means memory latency. The forthcoming upgrade to HBv3-series VMs will address both needs by growing L3 cache memory to an unprecedented 1.5 gigabytes per virtual machine. This is three times larger than what is found in the standard 3rd Gen EPYC processors currently in HBv3-series VMs, and more than 25 times larger than the total L3 cache found in most HPC servers in customer datacenters today.
For memory bandwidth-bound workloads to run at an appropriate scale, the net effect of the larger L3 cache is an up to 1.8x increase in effective memory bandwidth. This means an HBv3 VM that today offers 350 GB/s (as measured by STREAM-TRIAD) will soon perform more like a VM with greater than 600 GB/s of memory bandwidth.
For memory latency-bound workloads, the net effect of the larger L3 cache in Milan-X processors is an up to 50 percent increase in cache hit rate and an overall cache latency range (latency from one core to nearest and farthest neighbor cores) that is, conversely, 50 percent lower than standard 3rd Gen EPYC processors.
Below is a sample of at-scale performance results with Azure HBv3 VMs enhanced with Milan-X processors as compared to the existing HBv3 series with standard 3rd Gen EPYC processors. Tests were conducted across a range of MPI scale scenarios, from 2 to 64 VMs (240 to 7,680 CPU cores).
0 comments:
Post a Comment