SPEC® CPU2017 Floating Point Speed Result

Copyright 2017-2018 Standard Performance Evaluation Corporation

Hewlett Packard Enterprise (Test Sponsor: HPE)

ProLiant DL560 Gen10
(2.00 GHz, Intel Xeon Platinum 8164)

SPECspeed2017_fp_base = 16900

SPECspeed2017_fp_peak = Not Run

CPU2017 License: 3 Test Date: Oct-2017
Test Sponsor: HPE Hardware Availability: Oct-2017
Tested by: HPE Software Availability: Sep-2017

Benchmark result graphs are available in the PDF report.

Hardware
CPU Name: Intel Xeon Platinum 8164
  Max MHz.: 3700
  Nominal: 2000
Enabled: 104 cores, 4 chips
Orderable: 1, 2, 4 chip(s)
Cache L1: 32 KB I + 32 KB D on chip per core
  L2: 1 MB I+D on chip per core
  L3: 35.75 MB I+D on chip per chip
  Other: None
Memory: 768 GB (48 x 16 GB 2Rx8 PC4-2666V-R)
Storage: 1 x 480 GB SATA SSD, RAID 0
Other: None
Software
OS: Red Hat Enterprise Linux Server release 7.3
(Maipo)
Kernel 3.10.0-514.el7.x86_64
Compiler: C/C++: Version 18.0.0.128 of Intel C/C++
Compiler for Linux;
Fortran: Version 18.0.0.128 of Intel Fortran
Compiler for Linux
Parallel: Yes
Firmware: HPE BIOS Version U34 released Oct-2017 (tested with U34 9/29/2017)
File System: xfs
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: Not Applicable
Other: None

Results Table

Benchmark Base Peak
Threads Seconds Ratio Seconds Ratio Seconds Ratio Threads Seconds Ratio Seconds Ratio Seconds Ratio
SPECspeed2017_fp_base 16900
SPECspeed2017_fp_peak Not Run
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
603.bwaves_s 104 75.7 7800 77.4 7620 75.7 7800
607.cactuBSSN_s 104 72.2 2310 72.5 2300 71.7 2320
619.lbm_s 104 65.2 80.3 64.9 80.7 65.3 80.3
621.wrf_s 104 1750 75.8 1760 75.3 1760 75.1
627.cam4_s 104 58.6 1510 58.2 1520 58.2 1520
628.pop2_s 104 2090 56.9 2070 57.3 2160 55.0
638.imagick_s 104 67.5 2140 68.1 2120 66.9 2160
644.nab_s 104 48.1 3630 48.1 3630 47.9 3650
649.fotonik3d_s 104 83.6 1090 83.7 1090 85.8 1060
654.roms_s 104 66.8 2360 65.4 2410 68.3 2310

Operating System Notes

 Stack size set to unlimited using "ulimit -s unlimited"
 Transparent Huge Pages enabled by default
 Prior to runcpu invocation
 Filesystem page cache synced and cleared with:
 sync; echo 3>       /proc/sys/vm/drop_caches
 IRQ balance service was stop using "service irqbalance stop"
 Tuned-adm profile was set to Throughtput-Performance

General Notes

Environment variables set by runcpu before the start of the run:
KMP_AFFINITY = "granularity=core,compact"
LD_LIBRARY_PATH = "/cpu2017/lib/ia32:/cpu2017/lib/intel64:/cpu2017/je5.0.1-32:/cpu2017/je5.0.1-64"
OMP_STACKSIZE = "192M"

 Binaries compiled on a system with 1x Intel Core i7-4790 CPU + 32GB RAM
 memory using Redhat Enterprise Linux 7.4

Platform Notes

BIOS Configuration:
 Intel Hyperthreading set to Disabled
 Thermal Configuration set to Maximum Cooling
 LLC Prefetch set to Enabled
 LLC Dead Line Allocation set to Disabled
 Stale A to S set to Enabled
 Memory Patrol Scrubbing set to Disabled
 Workload Profile set to General Peak Frequency Compute
  Energy/Performance Bias set to Maximum Performance
 Workload Profile set to Custom
  NUMA Group Size Optimization set to Flat
 Sysinfo program /cpu2017/bin/sysinfo
 Rev: r5797 of 2017-06-14 96c45e4568ad54c135fd618bcc091c0f
 running on DL560-Gen10 Tue Oct 31 13:25:21 2017

 SUT (System Under Test) info as seen by some common utilities.
 For more information on this section, see
    https://www.spec.org/cpu2017/Docs/config.html#sysinfo

 From /proc/cpuinfo
    model name : Intel(R) Xeon(R) Platinum 8164 CPU @ 2.00GHz
       4  "physical id"s (chips)
       104 "processors"
    cores, siblings (Caution: counting these is hw and system dependent. The following
    excerpts from /proc/cpuinfo might not be reliable.  Use with caution.)
       cpu cores : 26
       siblings  : 26
       physical 0: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 16 17 18 19 20 21 22 24 25 26 27 28
       29
       physical 1: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 16 17 18 19 20 21 22 24 25 26 27 28
       29
       physical 2: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 16 17 18 19 20 21 22 24 25 26 27 28
       29
       physical 3: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 16 17 18 19 20 21 22 24 25 26 27 28
       29

 From lscpu:
      Architecture:          x86_64
      CPU op-mode(s):        32-bit, 64-bit
      Byte Order:            Little Endian
      CPU(s):                104
      On-line CPU(s) list:   0-103
      Thread(s) per core:    1
      Core(s) per socket:    26
      Socket(s):             4
      NUMA node(s):          4
      Vendor ID:             GenuineIntel
      CPU family:            6
      Model:                 85
      Model name:            Intel(R) Xeon(R) Platinum 8164 CPU @ 2.00GHz
      Stepping:              4
      CPU MHz:               2000.000
      BogoMIPS:              4004.79
      Virtualization:        VT-x
      L1d cache:             32K
      L1i cache:             32K
      L2 cache:              1024K
      L3 cache:              36608K
      NUMA node0 CPU(s):     0-25
      NUMA node1 CPU(s):     26-51
      NUMA node2 CPU(s):     52-77
      NUMA node3 CPU(s):     78-103

 /proc/cpuinfo cache data
    cache size : 36608 KB

 From numactl --hardware  WARNING: a numactl 'node' might or might not correspond to a
 physical chip.
   available: 4 nodes (0-3)
   node 0 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
   node 0 size: 196266 MB
   node 0 free: 191136 MB
   node 1 cpus: 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50
   51
   node 1 size: 196608 MB
   node 1 free: 191542 MB
   node 2 cpus: 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76
   77
   node 2 size: 196608 MB
   node 2 free: 191380 MB
   node 3 cpus: 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101
   102 103
   node 3 size: 196607 MB
   node 3 free: 190310 MB
   node distances:
   node   0   1   2   3
     0:  10  21  21  21
     1:  21  10  21  21
     2:  21  21  10  21
     3:  21  21  21  10

 From /proc/meminfo
    MemTotal:       792067508 kB
    HugePages_Total:       0
    Hugepagesize:       2048 kB

 From /etc/*release* /etc/*version*
    os-release:
       NAME="Red Hat Enterprise Linux Server"
       VERSION="7.3 (Maipo)"
       ID="rhel"
       ID_LIKE="fedora"
       VERSION_ID="7.3"
       PRETTY_NAME="Red Hat Enterprise Linux Server 7.3 (Maipo)"
       ANSI_COLOR="0;31"
       CPE_NAME="cpe:/o:redhat:enterprise_linux:7.3:GA:server"
    redhat-release: Red Hat Enterprise Linux Server release 7.3 (Maipo)
    system-release: Red Hat Enterprise Linux Server release 7.3 (Maipo)
    system-release-cpe: cpe:/o:redhat:enterprise_linux:7.3:ga:server

 uname -a:
    Linux DL560-Gen10 3.10.0-514.el7.x86_64 #1 SMP Wed Oct 19 11:24:13 EDT 2016 x86_64
    x86_64 x86_64 GNU/Linux

 run-level 3 Oct 31 10:46

 SPEC is set to: /cpu2017
    Filesystem     Type  Size  Used Avail Use% Mounted on
    /dev/sdb1      xfs   447G   32G  416G   8% /

 Additional information from dmidecode follows.  WARNING: Use caution when you interpret
 this section. The 'dmidecode' program reads system data which is "intended to allow
 hardware to be accurately determined", but the intent may not be met, as there are
 frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard.
   BIOS HPE U34 09/29/2017
   Memory:
    48x UNKNOWN NOT AVAILABLE 16 GB 2 rank 2666

 (End of data from sysinfo program)

Base Compiler Invocation

C benchmarks:

 icc 

Fortran benchmarks:

 ifort 

Benchmarks using both Fortran and C:

 ifort   icc 

Benchmarks using Fortran, C, and C++:

 icpc   icc   ifort 

Base Portability Flags

603.bwaves_s:  -DSPEC_LP64 
607.cactuBSSN_s:  -DSPEC_LP64 
619.lbm_s:  -DSPEC_LP64 
621.wrf_s:  -DSPEC_LP64   -DSPEC_CASE_FLAG   -convert big_endian 
627.cam4_s:  -DSPEC_LP64   -DSPEC_CASE_FLAG 
628.pop2_s:  -DSPEC_LP64   -DSPEC_CASE_FLAG   -convert big_endian   -assume byterecl 
638.imagick_s:  -DSPEC_LP64 
644.nab_s:  -DSPEC_LP64 
649.fotonik3d_s:  -DSPEC_LP64 
654.roms_s:  -DSPEC_LP64 

Base Optimization Flags

C benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -qopenmp   -DSPEC_OPENMP 

Fortran benchmarks:

 -DSPEC_OPENMP   -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -qopenmp   -nostandard-realloc-lhs   -align array32byte 

Benchmarks using both Fortran and C:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -qopenmp   -DSPEC_OPENMP   -nostandard-realloc-lhs   -align array32byte 

Benchmarks using Fortran, C, and C++:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -qopenmp   -DSPEC_OPENMP   -nostandard-realloc-lhs   -align array32byte 

Base Other Flags

C benchmarks:

 -m64   -std=c11 

Fortran benchmarks:

 -m64 

Benchmarks using both Fortran and C:

 -m64   -std=c11 

Benchmarks using Fortran, C, and C++:

 -m64   -std=c11 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2017/flags/Intel-ic18.0-official-linux64.2017-10-19.html,
http://www.spec.org/cpu2017/flags/HPE-Platform-Flags-Intel-V1.2-SKX-revG.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2017/flags/Intel-ic18.0-official-linux64.2017-10-19.xml,
http://www.spec.org/cpu2017/flags/HPE-Platform-Flags-Intel-V1.2-SKX-revG.xml.