AMD Turbo CORE technology up to 3.30 GHz
Base and Peak User Environment (bash):
export OMP_NUM_THREADS=32
export OMP_DYNAMIC=FALSE
export O64_OMP_SPIN_COUNT=1000000000
export O64_OMP_SPIN_USER_LOCK=TRUE
export OMP_SLAVE_STACK=22M
export O64_OMP_SET_AFFINITY=TRUE
export O64_OMP_AFFINITY_MAP=0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31
ulimit -s unlimited
For Compiler/OpenMP flags description please refer:
SGI-x86-open64-424-flags.html
Base tuning:
C : opencc -mp -march=bdver1 -Ofast -mcmodel=medium
Fortran : openf90 -mp -march=bdver1 -Ofast -OPT:early_mp=on -mcmodel=medium
ONESTEP=yes
Peak tuning:
ONESTEP=yes for all benchmarks.
311.wupwise_l: basepeak=yes
313.swim_l : FOPTIMIZE = -mp -O3 -OPT:Ofast -m3dnow -mcmodel=medium -LNO:simd=0:prefetch_ahead=5:prefetch=3
OMP_NUM_THREADS=16
O64_OMP_AFFINITY_MAP=1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31
315.mgrid_l : FOPTIMIZE = -mp -Ofast -march=barcelona -OPT:early_mp=on -mcmodel=medium
317.applu_l : FOPTIMIZE = -mp -Ofast -march=barcelona -OPT:early_mp=on -mcmodel=medium
321.equake_l : basepeak=yes
325.apsi_l : basepeak=yes
327.gafort_l : FOPTIMIZE = -mp -Ofast -march=barcelona -OPT:early_mp=on -mcmodel=medium
329.fma3d_l : basepeak=yes
331.art_l : basepeak=yes
Additional notes for disk subsystem:
The disk subsystem is mounted through NFS over IPoIB via 4x DDR
InfiniBand network. There is no local disk on the system.
|