SPEC® CFP2006 Result

Copyright 2006-2017 Standard Performance Evaluation Corporation

Huawei

Huawei 1288H V5 (Intel Xeon Silver 4116)

CPU2006 license: 3175 Test date: Jul-2017
Test sponsor: Huawei Hardware Availability: Sep-2017
Tested by: Huawei Software Availability: Nov-2016
Benchmark results graph
Hardware
CPU Name: Intel Xeon Silver 4116
CPU Characteristics: Intel Turbo Boost Technology up to 3.00 GHz
CPU MHz: 2100
FPU: Integrated
CPU(s) enabled: 24 cores, 2 chips, 12 cores/chip, 2 threads/core
CPU(s) orderable: 1,2 chip
Primary Cache: 32 KB I + 32 KB D on chip per core
Secondary Cache: 1 MB I+D on chip per core
L3 Cache: 16.5 MB I+D on chip per chip
Other Cache: None
Memory: 384 GB (24 x 16 GB 2Rx8 PC4-2666V-R, running at
2400 MHz)
Disk Subsystem: 1 x 1200 GB SAS, 10000 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 7.3
(Maipo)
3.10.0-514.el7.x86_64
Compiler: C/C++: Version 17.0.0.098 of Intel C/C++
Compiler for Linux;
Fortran: Version 17.0.0.098 of Intel Fortran
Compiler for Linux
Auto Parallel: No
File System: xfs
System State: Run level 3 (multi-user)
Base Pointers: 32/64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 48 765 852 766 852 766 852 48 766 852 766 852 766 852
416.gamess 48 1142 823 1147 820 1141 824 48 1112 845 1102 853 1106 850
433.milc 48 516 855 515 855 516 854 48 516 855 515 855 516 854
434.zeusmp 48 371 1180 372 1170 373 1170 48 371 1180 372 1170 373 1170
435.gromacs 48 342 1000 344 998 341 1000 48 337 1020 336 1020 336 1020
436.cactusADM 48 461 1240 462 1240 462 1240 48 461 1240 462 1240 462 1240
437.leslie3d 48 726 621 725 622 729 619 48 731 617 725 623 726 622
444.namd 48 562 685 561 687 560 688 48 560 688 556 693 561 686
447.dealII 48 419 1310 419 1310 417 1320 48 419 1310 419 1310 417 1320
450.soplex 48 640 625 642 624 643 623 48 610 657 610 656 610 656
453.povray 48 228 1120 228 1120 228 1120 48 195 1310 197 1300 194 1310
454.calculix 48 313 1270 310 1280 313 1270 48 313 1270 310 1280 313 1270
459.GemsFDTD 48 866 588 864 589 864 589 48 864 589 866 588 863 590
465.tonto 48 527 897 537 880 535 883 48 494 955 500 946 498 949
470.lbm 48 577 1140 576 1140 576 1140 48 577 1140 576 1140 576 1140
481.wrf 48 530 1010 536 1000 539 995 48 530 1010 536 1000 539 995
482.sphinx3 48 1092 857 1090 858 1092 856 48 1092 857 1090 858 1092 856

Submit Notes

 The numactl mechanism was used to bind copies to processors. The config file option 'submit'
 was used to generate numactl commands to bind each copy to a specific processor.
 For details, please see the config file.

Operating System Notes

 Stack size set to unlimited using "ulimit -s unlimited"

Platform Notes

 BIOS configuration:
 Set Power Efficiency Mode to Performance
 Set Patrol Scrub to Disable
 Sysinfo program /spec17/config/sysinfo.rev6993
 Revision 6993 of 2015-11-06 (b5e8d4b4eb51ed28d7f98696cbe290c1)
 running on localhost.localdomain Fri Jul 21 15:32:53 2017

 This section contains SUT (System Under Test) info as seen by
 some common utilities.  To remove or add to this section, see:
   http://www.spec.org/cpu2006/Docs/config.html#sysinfo

 From /proc/cpuinfo
    model name : Intel(R) Xeon(R) Silver 4116 CPU @ 2.10GHz
       2 "physical id"s (chips)
       48 "processors"
    cores, siblings (Caution: counting these is hw and system dependent.  The
    following excerpts from /proc/cpuinfo might not be reliable.  Use with
    caution.)
       cpu cores : 12
       siblings  : 24
       physical 0: cores 0 1 2 3 4 5 8 9 10 11 12 13
       physical 1: cores 0 1 2 3 4 5 8 9 10 11 12 13
    cache size : 16896 KB

 From /proc/meminfo
    MemTotal:       394144692 kB
    HugePages_Total:       0
    Hugepagesize:       2048 kB

 From /etc/*release* /etc/*version*
    os-release:
       NAME="Red Hat Enterprise Linux Server"
       VERSION="7.3 (Maipo)"
       ID="rhel"
       ID_LIKE="fedora"
       VERSION_ID="7.3"
       PRETTY_NAME="Red Hat Enterprise Linux Server 7.3 (Maipo)"
       ANSI_COLOR="0;31"
       CPE_NAME="cpe:/o:redhat:enterprise_linux:7.3:GA:server"
    redhat-release: Red Hat Enterprise Linux Server release 7.3 (Maipo)
    system-release: Red Hat Enterprise Linux Server release 7.3 (Maipo)
    system-release-cpe: cpe:/o:redhat:enterprise_linux:7.3:ga:server

 uname -a:
    Linux localhost.localdomain 3.10.0-514.el7.x86_64 #1 SMP Wed Oct 19 11:24:13
    EDT 2016 x86_64 x86_64 x86_64 GNU/Linux

 run-level 3 Jul 19 11:54

 SPEC is set to: /spec17
    Filesystem     Type  Size  Used Avail Use% Mounted on
    /dev/sda2      xfs   262G   86G  176G  33% /
 Additional information from dmidecode:

    Warning: Use caution when you interpret this section. The 'dmidecode' program
    reads system data which is "intended to allow hardware to be accurately
    determined", but the intent may not be met, as there are frequent changes to
    hardware, firmware, and the "DMTF SMBIOS" standard.

   BIOS INSYDE Corp. 0.20 07/14/2017
   Memory:
    24x Samsung M393A2K43BB1-CTD 16 GB 2 rank 2666 MHz, configured at 2400 MHz

 (End of data from sysinfo program)

General Notes

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/spec17/libs/32:/spec17/libs/64:/spec17/sh10.2"

 Binaries compiled on a system with 1x Intel Core i7-4790 CPU + 32GB RAM
 memory using Redhat Enterprise Linux 7.2
 Transparent Huge Pages enabled with:
 echo always > /sys/kernel/mm/transparent_hugepage/enabled
 Filesystem page cache cleared with:
 echo 1>       /proc/sys/vm/drop_caches
 runspec command invoked through numactl i.e.:
 numactl --interleave=all runspec <etc>

Base Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks:

 icpc -m64 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -auto-p32   -qopt-mem-layout-trans=3 

C++ benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -auto-p32   -qopt-mem-layout-trans=3 

Fortran benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch 

Benchmarks using both Fortran and C:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -auto-p32   -qopt-mem-layout-trans=3 

Peak Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks (except as noted below):

 icpc -m64 
450.soplex:  icpc -m32 -L/opt/intel/compilers_and_libraries_2017/linux/lib/ia32 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -D_FILE_OFFSET_BITS=64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  basepeak = yes 
482.sphinx3:  basepeak = yes 

C++ benchmarks:

444.namd:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -fno-alias   -auto-ilp32   -qopt-mem-layout-trans=3 
447.dealII:  basepeak = yes 
450.soplex:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -qopt-malloc-options=3   -qopt-mem-layout-trans=3 
453.povray:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -unroll4   -qopt-mem-layout-trans=3 

Fortran benchmarks:

410.bwaves:  -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch 
416.gamess:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -unroll2   -inline-level=0   -scalar-rep- 
434.zeusmp:  basepeak = yes 
437.leslie3d:  Same as 410.bwaves 
459.GemsFDTD:  Same as 410.bwaves 
465.tonto:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -unroll4   -auto   -inline-calloc   -qopt-malloc-options=3 

Benchmarks using both Fortran and C:

435.gromacs:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -qopt-prefetch   -auto-ilp32   -qopt-mem-layout-trans=3 
436.cactusADM:  basepeak = yes 
454.calculix:  basepeak = yes 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/Intel-ic17.0-official-linux64.html,
http://www.spec.org/cpu2006/flags/Huawei-Platform-Settings-SKL-V1.6.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/Intel-ic17.0-official-linux64.xml,
http://www.spec.org/cpu2006/flags/Huawei-Platform-Settings-SKL-V1.6.xml.