SPEC CPU(R)2017 Floating Point Rate Result Inspur Corporation Inspur NF3180A6 (AMD EPYC 7713P) CPU2017 License: 3358 Test date: Jul-2021 Test sponsor: Inspur Corporation Hardware availability: Jun-2021 Tested by: Inspur Corporation Software availability: Mar-2021 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate --------------- ------- --------- --------- ------- --------- --------- 503.bwaves_r 64 1714 374 * 16 402 399 S 503.bwaves_r 64 1714 375 S 16 403 399 * 507.cactuBSSN_r 64 224 362 S 64 225 360 * 507.cactuBSSN_r 64 224 362 * 64 224 362 S 508.namd_r 64 248 245 S 128 434 280 S 508.namd_r 64 248 245 * 128 434 280 * 510.parest_r 64 466 359 S 64 464 361 S 510.parest_r 64 471 355 * 64 467 359 * 511.povray_r 64 392 381 * 128 700 427 * 511.povray_r 64 392 382 S 128 699 428 S 519.lbm_r 64 741 91.0 * 32 369 91.4 * 519.lbm_r 64 741 91.0 S 32 369 91.4 S 521.wrf_r 64 564 254 S 32 260 276 * 521.wrf_r 64 565 254 * 32 259 276 S 526.blender_r 64 260 374 * 128 459 425 * 526.blender_r 64 260 375 S 128 459 425 S 527.cam4_r 64 307 365 * 128 604 371 * 527.cam4_r 64 306 365 S 128 603 371 S 538.imagick_r 64 116 1370 * 128 205 1550 * 538.imagick_r 64 116 1370 S 128 205 1550 S 544.nab_r 64 224 481 * 128 375 575 * 544.nab_r 64 222 485 S 128 375 575 S 549.fotonik3d_r 64 2016 124 S 32 990 126 S 549.fotonik3d_r 64 2016 124 * 32 990 126 * 554.roms_r 64 900 113 S 32 412 123 * 554.roms_r 64 901 113 * 32 411 124 S ================================================================================= 503.bwaves_r 64 1714 374 * 16 403 399 * 507.cactuBSSN_r 64 224 362 * 64 225 360 * 508.namd_r 64 248 245 * 128 434 280 * 510.parest_r 64 471 355 * 64 467 359 * 511.povray_r 64 392 381 * 128 700 427 * 519.lbm_r 64 741 91.0 * 32 369 91.4 * 521.wrf_r 64 565 254 * 32 260 276 * 526.blender_r 64 260 374 * 128 459 425 * 527.cam4_r 64 307 365 * 128 604 371 * 538.imagick_r 64 116 1370 * 128 205 1550 * 544.nab_r 64 224 481 * 128 375 575 * 549.fotonik3d_r 64 2016 124 * 32 990 126 * 554.roms_r 64 901 113 * 32 412 123 * SPECrate(R)2017_fp_base 295 SPECrate(R)2017_fp_peak 318 HARDWARE -------- CPU Name: AMD EPYC 7713P Max MHz: 3675 Nominal: 2000 Enabled: 64 cores, 1 chip, 2 threads/core Orderable: 1 chip Cache L1: 32 KB I + 32 KB D on chip per core L2: 512 KB I+D on chip per core L3: 256 MB I+D on chip per chip, 32 MB shared / 8 cores Other: None Memory: 512 GB (16 x 32 GB 2Rx4 PC4-3200AA-R) Storage: 1 x 480 GB SATA SSD Other: None SOFTWARE -------- OS: Red Hat Enterprise Linux release 8.2 (Ootpa) 4.18.0-193.el8.x86_64 Compiler: C/C++/Fortran: Version 3.0.0 of AOCC Parallel: No Firmware: Version 1.6.1 released Jun-2021 File System: xfs System State: Run level 3 (multi-user) Base Pointers: 64-bit Peak Pointers: 64-bit Other: jemalloc: jemalloc memory allocator library v5.2.0 Power Management: BIOS and OS set to prefer performance at the cost of additional power usage. Compiler Notes -------------- The AMD64 AOCC Compiler Suite is available at http://developer.amd.com/amd-aocc/ Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit runcpu command invoked through numactl i.e.: numactl --interleave=all runcpu Set dirty_ratio=8 to limit dirty cache to 8% of memory Set swappiness=1 to swap only if necessary Set zone_reclaim_mode=1 to free local node memory and avoid remote memory sync then drop_caches=3 to reset caches before invoking runcpu ASLR is disabled to reduce run-to-run issues. dirty_ratio, swappiness, zone_reclaim_mode, drop_caches and ASLR were all set using privileged echo (e.g. echo 1 > /proc/sys/vm/swappiness). Transparent huge pages set to 'always' for Int Peak, FP Base/Peak (OS default) Transparent huge pages set to 'madvise' for Int Base Environment Variables Notes --------------------------- Environment variables set by runcpu before the start of the run: LD_LIBRARY_PATH = "/home/CPU2017/amd_rate_aocc300_milan_A_lib/64;/home/CPU2017/amd_rate_ao cc300_milan_A_lib/32:" MALLOC_CONF = "retain:true" General Notes ------------- Binaries were compiled on a system with 2x AMD EPYC 7742 CPU + 512GiB Memory using OpenSUSE 15.2 NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) is mitigated in the system as tested and documented. jemalloc: configured and built with GCC v9.1.0 in Ubuntu 19.04 with -O3 -znver2 -flto jemalloc 5.2.0 is available here: https://github.com/jemalloc/jemalloc/releases/download/5.2.0/jemalloc-5.2.0.tar.bz2 Platform Notes -------------- BIOS configuration: NUMA nodes per socket set to NPS4 DRAM Scrub time set to Disable cTDP Control set to Manual cTDP set to 240 APBDIS set to 1 Sysinfo program /home/CPU2017/bin/sysinfo Rev: r6622 of 2021-04-07 982a61ec0915b55891ef0e16acafc64d running on localhost.localdomain Wed Jan 1 11:06:04 2020 SUT (System Under Test) info as seen by some common utilities. For more information on this section, see https://www.spec.org/cpu2017/Docs/config.html#sysinfo From /proc/cpuinfo model name : AMD EPYC 7713P 64-Core Processor 1 "physical id"s (chips) 128 "processors" cores, siblings (Caution: counting these is hw and system dependent. The following excerpts from /proc/cpuinfo might not be reliable. Use with caution.) cpu cores : 64 siblings : 128 physical 0: cores 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 From lscpu from util-linux 2.32.1: Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian CPU(s): 128 On-line CPU(s) list: 0-127 Thread(s) per core: 2 Core(s) per socket: 64 Socket(s): 1 NUMA node(s): 4 Vendor ID: AuthenticAMD CPU family: 25 Model: 1 Model name: AMD EPYC 7713P 64-Core Processor Stepping: 1 CPU MHz: 2388.998 CPU max MHz: 2000.0000 CPU min MHz: 1500.0000 BogoMIPS: 3992.65 Virtualization: AMD-V L1d cache: 32K L1i cache: 32K L2 cache: 512K L3 cache: 32768K NUMA node0 CPU(s): 0-15,64-79 NUMA node1 CPU(s): 16-31,80-95 NUMA node2 CPU(s): 32-47,96-111 NUMA node3 CPU(s): 48-63,112-127 Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 invpcid_single hw_pstate sme ssbd mba sev ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload vgif umip pku ospke vaes vpclmulqdq rdpid overflow_recov succor smca /proc/cpuinfo cache data cache size : 512 KB From numactl --hardware WARNING: a numactl 'node' might or might not correspond to a physical chip. available: 4 nodes (0-3) node 0 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 node 0 size: 128831 MB node 0 free: 128400 MB node 1 cpus: 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 node 1 size: 128987 MB node 1 free: 128471 MB node 2 cpus: 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 node 2 size: 129014 MB node 2 free: 128652 MB node 3 cpus: 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 node 3 size: 129003 MB node 3 free: 128639 MB node distances: node 0 1 2 3 0: 10 12 12 12 1: 12 10 12 12 2: 12 12 10 12 3: 12 12 12 10 From /proc/meminfo MemTotal: 528217328 kB HugePages_Total: 0 Hugepagesize: 2048 kB /sbin/tuned-adm active It seems that tuned daemon is not running, preset profile is not activated. Preset profile: throughput-performance /sys/devices/system/cpu/cpu*/cpufreq/scaling_governor has performance From /etc/*release* /etc/*version* os-release: NAME="Red Hat Enterprise Linux" VERSION="8.2 (Ootpa)" ID="rhel" ID_LIKE="fedora" VERSION_ID="8.2" PLATFORM_ID="platform:el8" PRETTY_NAME="Red Hat Enterprise Linux 8.2 (Ootpa)" ANSI_COLOR="0;31" redhat-release: Red Hat Enterprise Linux release 8.2 (Ootpa) system-release: Red Hat Enterprise Linux release 8.2 (Ootpa) system-release-cpe: cpe:/o:redhat:enterprise_linux:8.2:ga uname -a: Linux localhost.localdomain 4.18.0-193.el8.x86_64 #1 SMP Fri Mar 27 14:35:58 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux Kernel self-reported vulnerability status: CVE-2018-12207 (iTLB Multihit): Not affected CVE-2018-3620 (L1 Terminal Fault): Not affected Microarchitectural Data Sampling: Not affected CVE-2017-5754 (Meltdown): Not affected CVE-2018-3639 (Speculative Store Bypass): Mitigation: Speculative Store Bypass disabled via prctl and seccomp CVE-2017-5753 (Spectre variant 1): Mitigation: usercopy/swapgs barriers and __user pointer sanitization CVE-2017-5715 (Spectre variant 2): Mitigation: Full AMD retpoline, IBPB: conditional, IBRS_FW, STIBP: conditional, RSB filling CVE-2020-0543 (Special Register Buffer Data Sampling): No status reported CVE-2019-11135 (TSX Asynchronous Abort): Not affected run-level 3 Jan 1 04:28 SPEC is set to: /home/CPU2017 Filesystem Type Size Used Avail Use% Mounted on /dev/mapper/rhel-home xfs 392G 8.4G 383G 3% /home From /sys/devices/virtual/dmi/id Vendor: Inspur Product: IIMS Product Family: Not specified Serial: produceSN Additional information from dmidecode 3.2 follows. WARNING: Use caution when you interpret this section. The 'dmidecode' program reads system data which is "intended to allow hardware to be accurately determined", but the intent may not be met, as there are frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard. Memory: 16x Micron Technology 36ASF4G72PZ-3G2E7 32 GB 2 rank 3200 BIOS: BIOS Vendor: BIOS_VENDOR BIOS Version: 1.6.1 BIOS Date: 06/04/2021 BIOS Revision: 5.22 (End of data from sysinfo program) Compiler Version Notes ---------------------- ============================================================================== C | 519.lbm_r(base, peak) 538.imagick_r(base, peak) | 544.nab_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== C++ | 508.namd_r(base, peak) 510.parest_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== C++, C | 511.povray_r(base, peak) 526.blender_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== C++, C, Fortran | 507.cactuBSSN_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== Fortran | 503.bwaves_r(base, peak) 549.fotonik3d_r(base, peak) | 554.roms_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== Fortran, C | 521.wrf_r(base, peak) 527.cam4_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ Base Compiler Invocation ------------------------ C benchmarks: clang C++ benchmarks: clang++ Fortran benchmarks: flang Benchmarks using both Fortran and C: flang clang Benchmarks using both C and C++: clang++ clang Benchmarks using Fortran, C, and C++: clang++ clang flang Base Portability Flags ---------------------- 503.bwaves_r: -DSPEC_LP64 507.cactuBSSN_r: -DSPEC_LP64 508.namd_r: -DSPEC_LP64 510.parest_r: -DSPEC_LP64 511.povray_r: -DSPEC_LP64 519.lbm_r: -DSPEC_LP64 521.wrf_r: -DSPEC_CASE_FLAG -Mbyteswapio -DSPEC_LP64 526.blender_r: -funsigned-char -D__BOOL_DEFINED -DSPEC_LP64 527.cam4_r: -DSPEC_CASE_FLAG -DSPEC_LP64 538.imagick_r: -DSPEC_LP64 544.nab_r: -DSPEC_LP64 549.fotonik3d_r: -DSPEC_LP64 554.roms_r: -DSPEC_LP64 Base Optimization Flags ----------------------- C benchmarks: -m64 -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti C++ benchmarks: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -mllvm -enable-partial-unswitch -mllvm -unroll-threshold=100 -finline-aggressive -flv-function-specialization -mllvm -loop-unswitch-threshold=200000 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -convert-pow-exp-to-int=false -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Fortran benchmarks: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Hz,1,0x1 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -Kieee -Mrecursive -mllvm -fuse-tile-inner-loop -funroll-loops -mllvm -extra-vectorizer-passes -mllvm -lsr-in-nested-loop -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using both Fortran and C: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -Hz,1,0x1 -Kieee -Mrecursive -mllvm -fuse-tile-inner-loop -funroll-loops -mllvm -extra-vectorizer-passes -mllvm -lsr-in-nested-loop -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using both C and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -enable-partial-unswitch -mllvm -unroll-threshold=100 -finline-aggressive -mllvm -loop-unswitch-threshold=200000 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -convert-pow-exp-to-int=false -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using Fortran, C, and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -enable-partial-unswitch -mllvm -unroll-threshold=100 -finline-aggressive -mllvm -loop-unswitch-threshold=200000 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -convert-pow-exp-to-int=false -Hz,1,0x1 -Kieee -Mrecursive -mllvm -fuse-tile-inner-loop -funroll-loops -mllvm -lsr-in-nested-loop -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Base Other Flags ---------------- C benchmarks: -Wno-unused-command-line-argument C++ benchmarks: -Wno-unused-command-line-argument Fortran benchmarks: -Wno-unused-command-line-argument Benchmarks using both Fortran and C: -Wno-unused-command-line-argument Benchmarks using both C and C++: -Wno-unused-command-line-argument Benchmarks using Fortran, C, and C++: -Wno-unused-command-line-argument Peak Compiler Invocation ------------------------ C benchmarks: clang C++ benchmarks: clang++ Fortran benchmarks: flang Benchmarks using both Fortran and C: flang clang Benchmarks using both C and C++: clang++ clang Benchmarks using Fortran, C, and C++: clang++ clang flang Peak Portability Flags ---------------------- Same as Base Portability Flags Peak Optimization Flags ----------------------- C benchmarks: 519.lbm_r: -m64 -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -lamdlibm -ljemalloc 538.imagick_r: Same as 519.lbm_r 544.nab_r: -m64 -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -lamdlibm -ljemalloc C++ benchmarks: 508.namd_r: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -finline-aggressive -mllvm -unroll-threshold=100 -flv-function-specialization -mllvm -enable-licm-vrp -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -lamdlibm -ljemalloc 510.parest_r: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-suppress-fmas -Wl,-mllvm -Wl,-function-specialize -Ofast -march=znver3 -fveclib=AMDLIBM -finline-aggressive -mllvm -unroll-threshold=100 -flv-function-specialization -mllvm -enable-licm-vrp -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -lamdlibm -ljemalloc Fortran benchmarks: 503.bwaves_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -Kieee -Mrecursive -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -lamdlibm -ljemalloc -lflang -lflangrti 549.fotonik3d_r: Same as 503.bwaves_r 554.roms_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -Kieee -Mrecursive -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -Hz,1,0x1 -mllvm -fuse-tile-inner-loop -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using both Fortran and C: 521.wrf_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -Kieee -Mrecursive -lamdlibm -ljemalloc -lflang -lflangrti 527.cam4_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-force-vector-interleave=1 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -O3 -ffast-math -funroll-loops -mllvm -extra-vectorizer-passes -mllvm -lsr-in-nested-loop -Mrecursive -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using both C and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -finline-aggressive -mllvm -unroll-threshold=100 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -lamdlibm -ljemalloc Benchmarks using Fortran, C, and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -unroll-threshold=100 -mllvm -loop-unswitch-threshold=200000 -finline-aggressive -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -convert-pow-exp-to-int=false -Kieee -Mrecursive -lamdlibm -ljemalloc -lflang -lflangrti Peak Other Flags ---------------- C benchmarks: -Wno-unused-command-line-argument C++ benchmarks: -Wno-unused-command-line-argument Fortran benchmarks: -Wno-unused-command-line-argument Benchmarks using both Fortran and C: -Wno-unused-command-line-argument Benchmarks using both C and C++: -Wno-unused-command-line-argument Benchmarks using Fortran, C, and C++: -Wno-unused-command-line-argument The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2017/flags/aocc300-flags-A1.html http://www.spec.org/cpu2017/flags/Inspur-Platform-Settings-amd-V1.0.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2017/flags/aocc300-flags-A1.xml http://www.spec.org/cpu2017/flags/Inspur-Platform-Settings-amd-V1.0.xml SPEC CPU and SPECrate are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ---------------------------------------------------------------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact info@spec.org. Copyright 2017-2021 Standard Performance Evaluation Corporation Tested with SPEC CPU(R)2017 v1.1.8 on 2020-01-01 11:06:04-0500. Report generated on 2021-08-04 18:49:41 by CPU2017 text formatter v6255. Originally published on 2021-08-03.