CPU2006 license: | 3 | Test date: | Aug-2016 |
---|---|---|---|
Test sponsor: | HPE | Hardware Availability: | Jun-2016 |
Tested by: | HPE | Software Availability: | Dec-2015 |
Hardware | |
---|---|
CPU Name: | Intel Xeon E7-8890 v4 |
CPU Characteristics: | Intel Turbo Boost Technology up to 3.40 GHz |
CPU MHz: | 2200 |
FPU: | Integrated |
CPU(s) enabled: | 384 cores, 16 chips, 24 cores/chip, 2 threads/core |
CPU(s) orderable: | 2 to 16 chips |
Primary Cache: | 32 KB I + 32 KB D on chip per core |
Secondary Cache: | 256 KB I+D on chip per core |
L3 Cache: | 60 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 4 TB (128 x 32 GB 2Rx4 PC4-2400T-L, running at 1600 MHz) |
Disk Subsystem: | 8 x C8S59A, 900 GB 10 K RPM SAS |
Other Hardware: | None |
Software | |
---|---|
Operating System: | SUSE Linux Enterprise Server 12 (x86_64) SP1 Kernel 3.12.53-60.30-default |
Compiler: | C/C++: Version 16.0.0.101 of Intel C++ Studio XE for Linux; Fortran: Version 16.0.0.101 of Intel Fortran Studio XE for Linux |
Auto Parallel: | No |
File System: | tmpfs |
System State: | Run level 5 (multi-user, w/GUI) |
Base Pointers: | 32/64-bit |
Peak Pointers: | 32/64-bit |
Other Software: | None |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 768 | 1834 | 5690 | 1844 | 5660 | 1834 | 5690 | 768 | 1834 | 5690 | 1844 | 5660 | 1834 | 5690 |
416.gamess | 768 | 1132 | 13300 | 1150 | 13100 | 1132 | 13300 | 768 | 1089 | 13800 | 1092 | 13800 | 1096 | 13700 |
433.milc | 768 | 1349 | 5230 | 1350 | 5220 | 1349 | 5220 | 768 | 1349 | 5230 | 1350 | 5220 | 1349 | 5220 |
434.zeusmp | 768 | 760 | 9200 | 754 | 9270 | 752 | 9290 | 768 | 760 | 9200 | 754 | 9270 | 752 | 9290 |
435.gromacs | 768 | 354 | 15500 | 356 | 15400 | 356 | 15400 | 768 | 343 | 16000 | 341 | 16100 | 339 | 16200 |
436.cactusADM | 768 | 913 | 10100 | 903 | 10200 | 907 | 10100 | 768 | 913 | 10100 | 903 | 10200 | 907 | 10100 |
437.leslie3d | 768 | 1835 | 3930 | 1840 | 3920 | 1845 | 3910 | 768 | 1835 | 3930 | 1840 | 3920 | 1845 | 3910 |
444.namd | 768 | 554 | 11100 | 556 | 11100 | 556 | 11100 | 768 | 552 | 11200 | 552 | 11200 | 553 | 11100 |
447.dealII | 768 | 480 | 18300 | 489 | 18000 | 476 | 18400 | 768 | 480 | 18300 | 489 | 18000 | 476 | 18400 |
450.soplex | 768 | 1510 | 4240 | 1511 | 4240 | 1518 | 4220 | 384 | 689 | 4650 | 689 | 4650 | 687 | 4660 |
453.povray | 768 | 240 | 17000 | 240 | 17000 | 242 | 16900 | 768 | 204 | 20000 | 203 | 20100 | 202 | 20200 |
454.calculix | 768 | 350 | 18100 | 351 | 18100 | 350 | 18100 | 768 | 350 | 18100 | 351 | 18100 | 350 | 18100 |
459.GemsFDTD | 768 | 2166 | 3760 | 2161 | 3770 | 2171 | 3750 | 768 | 2166 | 3760 | 2161 | 3770 | 2171 | 3750 |
465.tonto | 768 | 709 | 10700 | 725 | 10400 | 711 | 10600 | 768 | 671 | 11300 | 662 | 11400 | 669 | 11300 |
470.lbm | 768 | 1391 | 7580 | 1392 | 7580 | 1379 | 7650 | 768 | 1391 | 7580 | 1392 | 7580 | 1379 | 7650 |
481.wrf | 768 | 1254 | 6840 | 1253 | 6850 | 1254 | 6840 | 768 | 1254 | 6840 | 1253 | 6850 | 1254 | 6840 |
482.sphinx3 | 768 | 2037 | 7350 | 2036 | 7350 | 2038 | 7340 | 768 | 2037 | 7350 | 2036 | 7350 | 2038 | 7340 |
The numactl mechanism was used to bind copies to processors. The config file option 'submit' was used to generate numactl commands to bind each copy to a specific processor. For details, please see the config file.
Stack size set to unlimited using "ulimit -s unlimited" Power profile set with: cpupower -c all frequency-set -g performance To control C-States, /dev/cpu_dma_latency pmqos interface is used: /usr/lib/tuned/pmqos-static.py cpu_dma_latency=199 Benchmark installed under /dev/shm/cpu2006 and mounted with: mount -o bind /dev/shm/cpu2006 /cpu2006 Transparent Huge Pages enabled with: echo always > /sys/kernel/mm/transparent_hugepage/enabled Filesystem page cache cleared with: echo 1 > /proc/sys/vm/drop_caches runspec command invoked through numactl i.e.: numactl --interleave=all runspec <etc> Setting the value of perf-bias: cpupower set -b 0 Tuned profile set with: tuned-adm profile throughput-performance
Firmware settings: Memory RAS Configuration set to Maximum Performance Sysinfo program /dev/shm/cpu2006/config/sysinfo.rev6914 $Rev: 6914 $ $Date:: 2014-06-25 #$ e3fbb8667b5a285932ceab81e28219e1 running on hawk049os1 Wed Aug 3 23:33:09 2016 This section contains SUT (System Under Test) info as seen by some common utilities. To remove or add to this section, see: http://www.spec.org/cpu2006/Docs/config.html#sysinfo From /proc/cpuinfo model name : Intel(R) Xeon(R) CPU E7-8890 v4 @ 2.20GHz 16 "physical id"s (chips) 768 "processors" cores, siblings (Caution: counting these is hw and system dependent. The following excerpts from /proc/cpuinfo might not be reliable. Use with caution.) cpu cores : 24 siblings : 48 physical 0: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 1: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 2: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 3: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 4: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 5: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 6: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 7: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 8: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 9: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 10: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 11: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 12: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 13: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 14: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 physical 15: cores 0 1 2 3 4 5 8 9 10 11 12 13 16 17 18 19 20 21 24 25 26 27 28 29 cache size : 61440 KB From /proc/meminfo MemTotal: 4235891396 kB HugePages_Total: 0 Hugepagesize: 2048 kB /usr/bin/lsb_release -d SUSE Linux Enterprise Server 12 SP1 From /etc/*release* /etc/*version* SuSE-release: SUSE Linux Enterprise Server 12 (x86_64) VERSION = 12 PATCHLEVEL = 1 # This file is deprecated and will be removed in a future service pack or release. # Please check /etc/os-release for details about this release. os-release: NAME="SLES" VERSION="12-SP1" VERSION_ID="12.1" PRETTY_NAME="SUSE Linux Enterprise Server 12 SP1" ID="sles" ANSI_COLOR="0;32" CPE_NAME="cpe:/o:suse:sles:12:sp1" uname -a: Linux hawk049os1 3.12.53-60.30-default #1 SMP Wed Feb 10 14:41:46 UTC 2016 (e57129f) x86_64 x86_64 x86_64 GNU/Linux run-level 5 Aug 3 23:12 SPEC is set to: /dev/shm/cpu2006 Filesystem Type Size Used Avail Use% Mounted on tmpfs tmpfs 2.0T 11G 2.0T 1% /dev/shm Additional information from dmidecode: Warning: Use caution when you interpret this section. The 'dmidecode' program reads system data which is "intended to allow hardware to be accurately determined", but the intent may not be met, as there are frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard. BIOS HP Bundle: 008.002.106 SFW: 041.139.000 06/10/2016 Memory: 128x HP HMA84GL7MFR4N-UH 32 GB 2 rank 1067 MHz, configured at 1600 MHz 256x not defined not defined (End of data from sysinfo program) Regarding the sysinfo display about the memory installed, the correct amount of memory is 4 TB and the dmidecode description should have one line reading as: 128x HP HMA84GL7MFR4N-UH 32 GB 2 rank 1067 MHz, configured at 1600 MHz
Environment variables set by runspec before the start of the run: LD_LIBRARY_PATH = "/dev/shm/cpu2006/libs/32:/dev/shm/cpu2006/libs/64:/dev/shm/cpu2006/sh" Binaries compiled on a system with 1x Intel Core i5-4670K CPU + 32GB memory using RedHat EL 7.1
icc -m64 |
icpc -m64 |
ifort -m64 |
icc -m64 ifort -m64 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
-xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch -auto-p32 -ansi-alias -opt-mem-layout-trans=3 |
-xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch -auto-p32 -ansi-alias -opt-mem-layout-trans=3 |
-xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch |
-xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch -auto-p32 -ansi-alias -opt-mem-layout-trans=3 |
icc -m64 |
icpc -m64 | |
450.soplex: | icpc -m32 -L/opt/intel/compilers_and_libraries_2016/linux/compiler/lib/ia32_lin |
ifort -m64 |
icc -m64 ifort -m64 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -D_FILE_OFFSET_BITS=64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
433.milc: | basepeak = yes |
470.lbm: | basepeak = yes |
482.sphinx3: | basepeak = yes |
444.namd: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -fno-alias -auto-ilp32 |
447.dealII: | basepeak = yes |
450.soplex: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -opt-malloc-options=3 |
453.povray: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -unroll4 -ansi-alias |
410.bwaves: | basepeak = yes |
416.gamess: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -prof-use(pass 2) -unroll2 -inline-level=0 -scalar-rep- |
434.zeusmp: | basepeak = yes |
437.leslie3d: | basepeak = yes |
459.GemsFDTD: | basepeak = yes |
465.tonto: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -prof-use(pass 2) -unroll4 -auto -inline-calloc -opt-malloc-options=3 |
435.gromacs: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -opt-prefetch -auto-ilp32 |
436.cactusADM: | basepeak = yes |
454.calculix: | basepeak = yes |
481.wrf: | basepeak = yes |