BrandPost: Performance Impact of Stacking Abaqus Jobs on 3rd Generation Intel Scalable Processors
Many clients today have models that run well on smaller core counts, or maybe license bound to a particular core count. Due to the large amount of CPUS available in a single system (56 or more), clients have expressed interest in “stacking” multiple smaller jobs on a single server to increase throughput.
Modern HPC system building blocks based on 3rd Generation Intel Xeon Scalable Processors have high memory bandwidth per core, and improved AVX512 and Turbo options that have made stacking less of a performance penalty than in the past.
To see the result of stacking on the latest 3rd Generation Intel Xeon Scalable Processors, we used the Abaqus e13 benchmark on an Intel 6346 based node that has 32 total cores per server. The baseline run was at 16 cores Exclusively (out of 32, which benefits from turbo effects), and then “stacked” two jobs on the same node, consuming all 32 cores (two 16-core jobs).
