Intel Hyper-Threading Technology (SMT): Enabled
Intel Turbo Boost Technology up to 3.33 GHz
ulimit -s unlimited
Removes limits on the maximum size of the automatically-
extended stack region of the current process and each
process it creates.
Compiler flags for base level optimization
COPTIMIZE : -O3 -xSSE4.2 -ipo -no-prec-div -unroll-loops0 -openmp
FOPTIMIZE : -O3 -xSSE4.2 -ipo -no-prec-div -unroll-loops0 -openmp
F77OPTIMIZE : -O3 -xSSE4.2 -ipo -no-prec-div -unroll-loops0 -openmp
Environment:
KMP_AFFINITY=disabled
controls the binding of OpenMP threads to the physical processing units
KMP_SCHEDULE=static,balanced
used to fine tune the load balancing of parallel loops that are
statically scheduled under OpenMP with no chunk size specification
KMP_BLOCKTIME=infinite
Sets the time, in milliseconds, that a thread should wait,
after completing the execution of a parallel region, before sleeping.
KMP_LIBRARY=throughput
Selects the OpenMP run-time library
KMP_STACKSIZE=31m
Sets the number of bytes to allocate for each parallel thread to use as
to use as its provate stack
OMP_NESTED=TRUE
Enables (TRUE) or disables (FALSE) nested parallelism.
OMP_DYNAMIC=FALSE
Enables (true) or disables (false) the dynamic adjustment of the number of threads.
OMP_NUM_THREADS=16
Sets the maximum number of threads to use for OpenMP* parallel
regions if no other value is specified in the program itself.
Flags for peak level optimization
310.wupwise_m peak flags:
fdo_pre0 = rm -rf ./*.dyn
PASS1_FFLAGS = -prof-gen
PASS2_FFLAGS = -prof-use
PASS1_LDFLAGS = -prof-gen
PASS2_LDFLAGS = -prof-use
312.swim_m peak flags:
fdo_pre0 = rm -rf ./*.dyn
PASS1_FFLAGS = -prof-gen
PASS2_FFLAGS = -prof-use
PASS1_LDFLAGS = -prof-gen
PASS2_LDFLAGS = -prof-use
316.applu_m peak flags:
COPTIMIZE = -O2 -xSSE4.2 -ipo -no-prec-div -openmp
FOPTIMIZE = -O2 -xSSE4.2 -ipo -no-prec-div -openmp
F77OPTIMIZE = -O2 -xSSE4.2 -ipo -no-prec-div -openmp
318.galgel_m peak flags:
ENV_OMP_NUM_THREADS=8
320.equake_m peak flags:
ENV_OMP_NUM_THREADS=8
srcalt:ompl.32
324.apsi_m peak flags:
COPTIMIZE = -O2 -xSSE4.2 -ipo -no-prec-div -openmp
FOPTIMIZE = -O2 -xSSE4.2 -ipo -no-prec-div -openmp
F77OPTIMIZE = -O2 -xSSE4.2 -ipo -no-prec-div -openmp
328.fma3d_m peak flags:
srcalt:ompl.32
fdo_pre0 = rm -rf ./*.dyn
PASS1_FFLAGS = -prof-gen
PASS2_FFLAGS = -prof-use
PASS1_LDFLAGS = -prof-gen
PASS2_LDFLAGS = -prof-use
330.art_mpeak flags:
COPTIMIZE = -O2 -xSSE4.2 -ipo -no-prec-div -openmp
FOPTIMIZE = -O2 -xSSE4.2 -ipo -no-prec-div -openmp
F77OPTIMIZE = -O2 -xSSE4.2 -ipo -no-prec-div -openmp
332.ammp_m peak flags:
COPTIMIZE = -O2 -xSSE4.2 -openmp
FOPTIMIZE = -O2 -xSSE4.2 -openmp
F77OPTIMIZE = -O2 -xSSE4.2 -openmp
For a description of the flags used, please see
Intel-ic11.1-intel64-linux-flags-file.html in the flags directory
The benchmark suite was installed and run on a RAM based
filesystem. No physical hard drives were used. The linux
default tmpfs file system type was used.
|