SPEC(R) MPIL2007 Summary Intel Corporation Intel Server System R2208WFTZS (Intel Xeon Gold 6148, 2.40 GHz) Sun Jul 23 16:39:18 2017 MPI2007 License: 13 Test date: Jul-2017 Test sponsor: Intel Corporation Hardware availability: Jul-2017 Tested by: Intel Corporation Software availability: Sep-2017 Base Base Base Peak Peak Peak Benchmarks Ranks Run Time Ratio Ranks Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 121.pop2 320 235 16.5 S 121.pop2 320 237 16.4 * 121.pop2 320 238 16.3 S 122.tachyon 320 202 9.62 S 122.tachyon 320 200 9.72 S 122.tachyon 320 200 9.71 * 125.RAxML 320 193 15.1 * 125.RAxML 320 193 15.1 S 125.RAxML 320 193 15.1 S 126.lammps 320 191 12.9 S 126.lammps 320 192 12.8 S 126.lammps 320 191 12.9 * 128.GAPgeofem 320 182 32.5 S 128.GAPgeofem 320 184 32.3 S 128.GAPgeofem 320 183 32.5 * 129.tera_tf 320 115 9.59 S 129.tera_tf 320 116 9.51 * 129.tera_tf 320 116 9.48 S 132.zeusmp2 320 118 18.0 * 132.zeusmp2 320 118 17.9 S 132.zeusmp2 320 117 18.1 S 137.lu 320 117 35.8 S 137.lu 320 117 35.8 * 137.lu 320 117 35.9 S 142.dmilc 320 136 27.1 S 142.dmilc 320 136 27.1 * 142.dmilc 320 136 27.2 S 143.dleslie 320 107 29.0 S 143.dleslie 320 108 28.7 S 143.dleslie 320 107 28.9 * 145.lGemsFDTD 320 218 20.2 S 145.lGemsFDTD 320 219 20.2 S 145.lGemsFDTD 320 218 20.2 * 147.l2wrf2 320 386 21.3 * 147.l2wrf2 320 386 21.2 S 147.l2wrf2 320 385 21.3 S ============================================================================== 121.pop2 320 237 16.4 * 122.tachyon 320 200 9.71 * 125.RAxML 320 193 15.1 * 126.lammps 320 191 12.9 * 128.GAPgeofem 320 183 32.5 * 129.tera_tf 320 116 9.51 * 132.zeusmp2 320 118 18.0 * 137.lu 320 117 35.8 * 142.dmilc 320 136 27.1 * 143.dleslie 320 107 28.9 * 145.lGemsFDTD 320 218 20.2 * 147.l2wrf2 320 386 21.3 * SPECmpiL_base2007 18.9 SPECmpiL_peak2007 Not Run BENCHMARK DETAILS ----------------- Type of System: Homogeneous Total Compute Nodes: 8 Total Chips: 16 Total Cores: 320 Total Threads: 640 Total Memory: 1536 GB Base Ranks Run: 320 Minimum Peak Ranks: -- Maximum Peak Ranks: -- C Compiler: Intel C++ Composer XE 2017 for Linux Version 17.0.4.196 Build 20170411 C++ Compiler: Intel C++ Composer XE 2017 for Linux Version 17.0.4.196 Build 20170411 Fortran Compiler: Intel Fortran Composer XE 2017 for Linux Version 17.0.4.196 Build 20170411 Base Pointers: 64-bit Peak Pointers: Not Applicable MPI Library: Intel MPI Library 17u4 for Linux Other MPI Info: None Pre-processors: No Other Software: None Node Description: Endeavor Node =============================== HARDWARE -------- Number of nodes: 8 Uses of the node: compute Vendor: Intel Model: Intel Server System R2208WFTZS (Intel Xeon Gold 6148, 2.4 GHz) CPU Name: Intel Xeon Gold 6148 CPU(s) orderable: 1-2 chips Chips enabled: 2 Cores enabled: 40 Cores per chip: 20 Threads per core: 2 CPU Characteristics: Intel Turbo Boost Technology up to 3.7 GHz CPU MHz: 2400 Primary Cache: 32 KB I + 32 KB D on chip per core Secondary Cache: 1 MB I+D on chip per core L3 Cache: 27.5 MB I+D on chip per chip Other Cache: None Memory: 192 GB (12 x 16 GB 2Rx4 DDR4-2666 ECC Registered) Disk Subsystem: 1 x 800 GB SSD (INTEL SSDSC2BA80) Other Hardware: None Adapter: Intel Omni-Path Fabric Adapter 100 series Number of Adapters: 1 Slot Type: PCI-Express x16 Data Rate: 12.5 GB/s Ports Used: 1 Interconnect Type: Intel Omni-Path Fabric Adapter 100 series Adapter: Intel Omni-Path Edge Switch 100 series Number of Adapters: 1 Slot Type: PCI-Express x16 Data Rate: 12.5 GB/s Ports Used: 1 Interconnect Type: Intel Omni-Path Fabric Adapter 100 series SOFTWARE -------- Adapter: Intel Omni-Path Fabric Adapter 100 series Adapter Driver: IFS 10.4 Adapter Firmware: 0.9-46 Adapter: Intel Omni-Path Edge Switch 100 series Adapter Driver: IFS 10.4 Adapter Firmware: 0.9-46 Operating System: Oracle Linux Server release 7.3, Kernel 3.10.0-514.6.2.0.1.el7.x86_64.knl1 Local File System: Linux/xfs Shared File System: LFS System State: Multi-User Other Software: IBM Platform LSF Standard 9.1.1.1 Node Description: Lustre FS =========================== HARDWARE -------- Number of nodes: 11 Uses of the node: fileserver Vendor: Intel Model: Intel Server System R2224GZ4GC4 CPU Name: Intel Xeon E5-2680 CPU(s) orderable: 1-2 chips Chips enabled: 2 Cores enabled: 16 Cores per chip: 8 Threads per core: 2 CPU Characteristics: Intel Turbo Boost Technology disabled CPU MHz: 2700 Primary Cache: 32 KB I + 32 KB D on chip per core Secondary Cache: 2 MB I+D on chip per chip L3 Cache: 20 MB I+D on chip per chip Other Cache: None Memory: 64 GB (8 x 8GB 1600MHz Reg ECC DDR3) Disk Subsystem: 2.1 TB Other Hardware: None Adapter: Intel Omni-Path Fabric Adapter 100 series Number of Adapters: 1 Slot Type: PCI-Express x16 Data Rate: 12.5 GB/s Ports Used: 1 Interconnect Type: Intel Omni-Path Fabric Adapter 100 series SOFTWARE -------- Adapter: Intel Omni-Path Fabric Adapter 100 series Adapter Driver: IFS 10.4 Adapter Firmware: 0.9-46 Operating System: Redhat* Enterprise Linux* Server Release 7.2, Kernel 3.10.0-514.6.2.0.1.el7.x86_64.knl1 Local File System: None Shared File System: Lustre FS System State: Multi-User Other Software: None Interconnect Description: Intel Omni-Path ========================================= HARDWARE -------- Vendor: Intel Model: Intel Omni-Path 100 series Switch Model: Intel Omni-Path Edge Switch 100 series Number of Switches: 24 Number of Ports: 48 Data Rate: 12.5 GB/s Firmware: 0.9-46 Topology: Fat tree Primary Use: MPI traffic Interconnect Description: Intel Omni-Path ========================================= HARDWARE -------- Vendor: Intel Corporation Model: Intel Omni-Path 100 series Switch Model: Intel Omni-Path Edge Switch 100 series Number of Switches: 1 Number of Ports: 48 Data Rate: 12.5 GB/s Firmware: 0.9-46 Topology: Fat tree Primary Use: Cluster File System Submit Notes ------------ The config file option 'submit' was used. General Notes ------------- MPI startup command: mpiexec.hydra command was used to start MPI jobs. Software environment: export I_MPI_COMPATIBILITY=3 export I_MPI_FABRICS=shm:tmi export I_MPI_HYDRA_PMI_CONNECT=alltoall Network: Endeavour Omni-Path fabric consists of 48-port switches = 24 core switches connected to each leaf of the rack switch. Job placement: Each MPI job was assigned to a topologically compact set of nodes, i.e. the minimal needed number of leaf switches was used for each job = 1 switch for 40/80/160/320/640 ranks, 2 switches for 1280 and 1980 ranks. IBM Platform LSF was used for job submission. It has no impact on performance. Information can be found at: http://www.ibm.com Base Compiler Invocation ------------------------ C benchmarks: mpiicc C++ benchmarks: 126.lammps: mpiicpc Fortran benchmarks: mpiifort Benchmarks using both Fortran and C: mpiicc mpiifort Base Portability Flags ---------------------- 121.pop2: -DSPEC_MPI_CASE_FLAG 126.lammps: -DMPICH_IGNORE_CXX_SEEK Base Optimization Flags ----------------------- C benchmarks: -O3 -xCORE-AVX512 -no-prec-div -ipo C++ benchmarks: 126.lammps: -O3 -xCORE-AVX512 -no-prec-div -ipo Fortran benchmarks: -O3 -xCORE-AVX512 -no-prec-div -ipo Benchmarks using both Fortran and C: -O3 -xCORE-AVX512 -no-prec-div -ipo The flags file that was used to format this result can be browsed at http://www.spec.org/mpi2007/flags/EM64T_Intel140_flags.20170822.html You can also download the XML flags source by saving the following link: http://www.spec.org/mpi2007/flags/EM64T_Intel140_flags.20170822.xml SPEC and SPEC MPI are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2010 Standard Performance Evaluation Corporation Tested with SPEC MPI2007 v2.0.1. Report generated on Tue Aug 22 18:38:27 2017 by MPI2007 ASCII formatter v1463. Originally published on 22 August 2017.