AMD Turbo CORE technology up to 3.30 GHz
Base and Peak User Environment (bash):
export OMP_NUM_THREADS=32
export OMP_DYNAMIC=FALSE
export O64_OMP_SPIN_COUNT=1000000000
export O64_OMP_SPIN_USER_LOCK=TRUE
export OMP_SLAVE_STACK=22M
export O64_OMP_SET_AFFINITY=TRUE
export O64_OMP_AFFINITY_MAP=0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31
ulimit -s unlimited
For Compiler/OpenMP flags description please refer:
SGI-x86-open64-424-flags.html
Base tuning:
C : opencc -mp -march=bdver1 -Ofast -mcmodel=medium
Fortran : openf90 -mp -march=bdver1 -Ofast -OPT:early_mp=on -mcmodel=medium
ONESTEP=yes
Extra art Base Flags
330.art_m : -DINTS_PER_CACHELINE=16 -DDBLS_PER_CACHELINE=8
Portablility flags:
318.galgel_m : -fixedform -col80
Peak tuning:
ONESTEP=yes for all benchmarks.
310.wupwise_m : FOPTIMIZE = -mp -Ofast -msse3 -OPT:early_mp=on -LNO:prefetch_ahead=5:prefetch=3 -march=bdver1
-mcmodel=medium -OPT:unroll_times_max=8:unroll_size=128:IEEE_NaN_Inf=off:ro=3
-IPA:linear=on:plimit=50000:callee_limit=5000 -INLINE:aggressive=on -CG:load_exe=0
312.swim_m : srcalt=ompl.32
FOPTIMIZE = -mp -O3 -OPT:Ofast -m3dnow -mcmodel=medium -LNO:simd=0
OMP_NUM_THREADS=16
O64_OMP_AFFINITY_MAP=1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31
314.mgrid_m : basepeak=yes
316.applu_m : srcalt=ompl
FOPTIMIZE =-Ofast -march=bdver1 -OPT:early_mp=on -mcmodel=medium
318.galgel_m : OMP_NUM_THREADS=16
O64_OMP_AFFINITY_MAP=1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31
COPTIMIZE = -Ofast -march=bdver1 -mcmodel=medium
FOPTIMIZE = -Ofast -march=bdver1 -OPT:early_mp=on -mcmodel=medium
320.equake_m : srcalt=ompl
COPTIMIZE = -Ofast -mp -OPT:alias=disjoint -march=bdver1 -mcmodel=medium
324.apsi_m : basepeak=yes
326.gafort_m : FOPTIMIZE = -Ofast -march=barcelona -OPT:early_mp=on -mcmodel=medium
328.fma3d_m : basepeak=yes
330.art_m : basepeak=yes
332.ammp_m : basepeak=yes
Alternate Sources :
314.mgrid_m (base,peak): intel
312.swim_m (peak): ompl.32
316.applu_m (peak): ompl
320.equake_m (peak): ompl
Additional notes for disk subsystem:
The disk subsystem is mounted through NFS over IPoIB via 4x DDR
InfiniBand network. There is no local disk on the system.
|