|
Intel Hatch: Intel Server D50DNP1SB (Xeon Platinum |
SPEChpc 2021_sml_base = 15.5 |
|
SPEChpc 2021_sml_peak = 16.9 |
| hpc2021 License: | 13 | Test Date: | Apr-2025 |
|---|---|---|---|
| Test Sponsor: | Intel | Hardware Availability: | Jan-2023 |
| Tested by: | Intel | Software Availability: | Mar-2025 |
Benchmark result graphs are available in the PDF report.
| Benchmark | Base | Peak | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | Ranks | Thrds/Rnk | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Model | Ranks | Thrds/Rnk | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
| SPEChpc 2021_sml_base | 15.5 | |||||||||||||||||
| SPEChpc 2021_sml_peak | 16.9 | |||||||||||||||||
| Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||||||
| 605.lbm_s | TGT | 16 | 1 | 44.6 | 34.8 | 43.4 | 35.7 | 43.4 | 35.8 | TGT | 16 | 1 | 41.4 | 37.5 | 41.2 | 37.6 | 41.3 | 37.5 |
| 613.soma_s | TGT | 16 | 1 | 54.5 | 29.4 | 54.7 | 29.3 | 54.7 | 29.2 | TGT | 16 | 1 | 47.4 | 33.7 | 47.3 | 33.8 | 47.4 | 33.8 |
| 618.tealeaf_s | TGT | 16 | 1 | 223 | 9.20 | 223 | 9.19 | 226 | 9.09 | TGT | 16 | 1 | 214 | 9.56 | 211 | 9.70 | 213 | 9.64 |
| 619.clvleaf_s | TGT | 16 | 1 | 119 | 13.9 | 119 | 13.9 | 119 | 13.9 | TGT | 16 | 1 | 118 | 14.0 | 118 | 13.9 | 118 | 14.0 |
| 621.miniswp_s | TGT | 16 | 1 | 94.9 | 11.6 | 93.9 | 11.7 | 94.3 | 11.7 | TGT | 16 | 1 | 94.3 | 11.7 | 93.6 | 11.7 | 93.2 | 11.8 |
| 628.pot3d_s | TGT | 16 | 1 | 131 | 12.8 | 131 | 12.8 | 131 | 12.8 | TGT | 16 | 1 | 131 | 12.8 | 131 | 12.8 | 131 | 12.8 |
| 632.sph_exa_s | TGT | 16 | 1 | 211 | 10.9 | 210 | 10.9 | 210 | 11.0 | TGT | 16 | 1 | 200 | 11.5 | 200 | 11.5 | 199 | 11.6 |
| 634.hpgmgfv_s | TGT | 16 | 1 | 156 | 6.23 | 158 | 6.18 | 160 | 6.09 | TGT | 16 | 1 | 103 | 9.48 | 97.9 | 9.96 | 97.6 | 9.99 |
| 635.weather_s | TGT | 16 | 1 | 66.4 | 39.2 | 66.7 | 39.0 | 66.6 | 39.1 | TGT | 16 | 1 | 66.4 | 39.2 | 66.7 | 39.0 | 66.6 | 39.1 |
| Hardware Summary | |
|---|---|
| Type of System: | Homogenous Cluster |
| Compute Node: | Intel Server D50DNP1SB (Xeon Platinum 8480+) |
| Interconnect: | Mellanox HDR |
| Compute Nodes Used: | 2 |
| Total Chips: | 4 |
| Total Cores: | 224 |
| Total Threads: | 448 |
| Total Memory: | 2 TB |
| Total Accelerators: | 8 |
| Max. Peak Threads: | 1 |
| Software Summary | |
|---|---|
| Compiler: | Intel oneAPI Compiler 2025.1.0 |
| MPI Library: | Intel MPI Library 2021.15 for Linux OS |
| Other MPI Info: | None |
| Other Software: | None |
| Base Parallel Model: | TGT |
| Base Ranks Run: | 16 |
| Base Threads Run: | 1 |
| Peak Parallel Models: | TGT |
| Minimum Peak Ranks: | 16 |
| Maximum Peak Ranks: | 16 |
| Max. Peak Threads: | 1 |
| Min. Peak Threads: | 1 |
| Hardware | |
|---|---|
| Number of nodes: | 2 |
| Uses of the node: | Compute |
| Vendor: | Intel |
| Model: | Intel Server D50DNP1SB (2 x Intel Xeon Platinum 8480+, 2.0GHz) |
| CPU Name: | Intel Xeon Platinum 8480+ |
| CPU(s) orderable: | 1, 2 chips |
| Chips enabled: | 2 |
| Cores enabled: | 112 |
| Cores per chip: | 56 |
| Threads per core: | 2 |
| CPU Characteristics: | Turbo Boost Technology up to 3.8 GHz |
| CPU MHz: | 2000 |
| Primary Cache: | 32 KB I + 48 KB D on chip per core |
| Secondary Cache: | 2 MB I+D on chip per core |
| L3 Cache: | 105 MB I+D on chip per chip |
| Other Cache: | None |
| Memory: | 1 TB (16x64 GB DDR5 2Rx4 PC5-4800B-R) |
| Disk Subsystem: | 1 x 1 1TB NVMe M.2 INTEL SSDPELKX010T8 |
| Other Hardware: | None |
| Accel Count: | 4 |
| Accel Model: | Intel Data Center GPU Max 1550 |
| Accel Vendor: | Intel |
| Accel Type: | GPU |
| Accel Connection: | PCIe Gen5 x16 |
| Accel ECC enabled: | yes |
| Accel Description: | Intel Data Center GPU Max 1550 |
| Adapter: | Mellanox ConnectX-6 HDR |
| Number of Adapters: | 1 |
| Slot Type: | PCI-Express 4.0 x16 |
| Data Rate: | 200Gbit/s |
| Ports Used: | 1 |
| Interconnect Type: | Mellanox HDR |
| Software | |
|---|---|
| Accelerator Driver: | 25.05.32567 |
| Adapter: | Mellanox ConnectX-6 HDR |
| Adapter Firmware: | 20.38.1900 |
| Operating System: | SUSE Linux Enterprise Server 15 SP6 6.4.0-150600.23.42-default |
| Local File System: | lustre |
| Shared File System: | LUSTRE FS |
| System State: | Run level 5 |
| Other Software: | None |
| Hardware | |
|---|---|
| Vendor: | Mellanox |
| Model: | Mellanox HDR |
| Switch Model: | Mellanox Technologies MT28908 Family InfiniBand Switch |
| Number of Switches: | 12 |
| Number of Ports: | 40 |
| Data Rate: | 200 Gbit/s |
| Firmware: | 20.38.1900 |
| Topology: | Fat-tree |
| Primary Use: | MPI Traffic, LustreFS traffic |
| Software |
|---|
The config file option 'submit' was used.
Environment variables set by runhpc before the start of the run: LIBOMPTARGET_LEVEL_ZERO_USE_IMMEDIATE_COMMAND_LIST = "all" I_MPI_FABRICS=shm:ofi I_MPI_OFFLOAD=1 I_MPI_OFFLOAD_CELL=tile I_MPI_OFFLOAD_TOPOLIB=level_zero I_MPI_OFFLOAD_CELL_LIST=0,1,2,3,4,5,6,7 For the following tests src.alt was used in PEAK: 613 618 621 632 634
Device Vendor Intel Device Version OpenCL 3.0 NEO Driver Version 25.05.32567 Base clock 900MHz Max clock frequency 1600MHz Tiles 2 Slices per Tile 1 Max compute units per Tile 512 Sub-slices per slice 64 EUs per sub-slice 8 Threads per EU 8 Max work item dimensions 3 Max work item sizes 1024x1024x1024 Max work group size 1024 Preferred work group size multiple 32 Max sub-groups per work group 64 Sub-group sizes 16, 32 L1 Cache per EU 65536 L2 cache size 427819008 Global memory size 137438953472 Address bits 64, Little-Endian
| mpiicc -cc=icx |
| mpiicpc -cxx=icpx |
| mpiifort -fc=ifx |
| 605.lbm_s: | -DUSE_MPI |
| 613.soma_s: | -DUSE_MPI -DSPEC_NO_VAR_ARRAY_REDUCE |
| 618.tealeaf_s: | -DUSE_MPI |
| 619.clvleaf_s: | -DUSE_MPI |
| 628.pot3d_s: | -DUSE_MPI |
| 635.weather_s: | -DUSE_MPI |
| mpiicc -cc=icx |
| mpiicpc -cxx=icpx |
| mpiifort -fc=ifx |
| 605.lbm_s: | -DUSE_MPI |
| 613.soma_s: | -DUSE_MPI -DSPEC_NO_VAR_ARRAY_REDUCE |
| 618.tealeaf_s: | -DUSE_MPI |
| 619.clvleaf_s: | -DUSE_MPI |
| 628.pot3d_s: | -DUSE_MPI |
| 635.weather_s: | -DUSE_MPI |
| 619.clvleaf_s: | -DSPEC_COLLAPSE -O3 -xCORE-AVX512 -flto -mprefer-vector-width=512 -ffast-math -fiopenmp -fopenmp-targets=spir64_gen -ftarget-register-alloc-mode=pvc:auto -Xopenmp-target-backend '-device pvc -revision_id 0x2f' -DSPEC_ACCEL_AWARE_MPI -fopenmp-target-loopopt |
| 628.pot3d_s: | basepeak = yes |
| 635.weather_s: | basepeak = yes |