SPEC CPU®2017 Integer Speed Result

Copyright 2017-2022 Standard Performance Evaluation Corporation

Lenovo Global Technology

ThinkSystem SR665 V3
2.40 GHz, AMD EPYC 9654

SPECspeed®2017_int_base = 13.90

SPECspeed®2017_int_peak = 14.00

CPU2017 License: 9017 Test Date: Oct-2022
Test Sponsor: Lenovo Global Technology Hardware Availability: Dec-2022
Tested by: Lenovo Global Technology Software Availability: Nov-2022

Benchmark result graphs are available in the PDF report.

Hardware
CPU Name: AMD EPYC 9654
  Max MHz: 3700
  Nominal: 2400
Enabled: 192 cores, 2 chips
Orderable: 1,2 chips
Cache L1: 32 KB I + 32 KB D on chip per core
  L2: 1 MB I+D on chip per core
  L3: 384 MB I+D on chip per chip,
32 MB shared / 8 cores
  Other: None
Memory: 768 GB (24 x 32 GB 2Rx8 PC5-4800B-R)
Storage: 1 x 480 GB SATA SSD
Other: None
Software
OS: Red Hat Enterprise Linux 8.6 (Ootpa)
Kernel 4.18.0-372.9.1.el8.x86_64
Compiler: C/C++/Fortran: Version 4.0.0 of AOCC
Parallel: Yes
Firmware: Lenovo BIOS Version KAE103A 1.10 released Sep-2022
File System: xfs
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 64-bit
Other: None
Power Management: BIOS and OS set to prefer performance at the cost
of additional power usage

Results Table

Benchmark Base Peak
Threads Seconds Ratio Seconds Ratio Seconds Ratio Threads Seconds Ratio Seconds Ratio Seconds Ratio
SPECspeed®2017_int_base 13.90
SPECspeed®2017_int_peak 14.00
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
600.perlbench_s 192 2140 8.28 2160 8.21 2130 8.32 1 2150 8.24 2140 8.30 2140 8.31
602.gcc_s 192 2820 14.10 2820 14.10 2810 14.20 192 2820 14.10 2820 14.10 2810 14.20
605.mcf_s 192 2350 20.10 2350 20.10 2350 20.10 1 2260 20.90 2270 20.80 2270 20.80
620.omnetpp_s 192 1580 10.30 1590 10.20 1590 10.30 192 1580 10.30 1590 10.20 1590 10.30
623.xalancbmk_s 192 75.3 18.80 75.1 18.90 75.6 18.70 1 69.5 20.40 69.6 20.30 69.5 20.40
625.x264_s 192 83.3 21.20 83.2 21.20 83.3 21.20 192 83.3 21.20 83.2 21.20 83.3 21.20
631.deepsjeng_s 192 2050 7.00 2060 6.94 2050 7.00 192 2050 7.00 2060 6.94 2050 7.00
641.leela_s 192 2940 5.80 2940 5.81 2930 5.81 192 2940 5.80 2940 5.81 2930 5.81
648.exchange2_s 192 1140 25.90 1140 25.90 1130 25.90 192 1140 25.90 1140 25.90 1130 25.90
657.xz_s 192 2360 26.10 2360 26.20 2350 26.30 192 2360 26.20 2350 26.30 2360 26.20

Compiler Notes

The AMD64 AOCC Compiler Suite is available at
http://developer.amd.com/amd-aocc/

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size limit
'ulimit -l 2097152' was used to set environment locked pages in memory limit

runcpu command invoked through numactl i.e.:
numactl --interleave=all runcpu <etc>

To limit dirty cache to 8% of memory, 'sysctl -w vm.dirty_ratio=8' run as root.
To limit swap usage to minimum necessary, 'sysctl -w vm.swappiness=1' run as root.
To free node-local memory and avoid remote memory usage,
'sysctl -w vm.zone_reclaim_mode=1' run as root.
To clear filesystem caches, 'sync; sysctl -w vm.drop_caches=3' run as root.
To disable address space layout randomization (ASLR) to reduce run-to-run
variability, 'sysctl -w kernel.randomize_va_space=0' run as root.

To enable Transparent Hugepages (THP) for all allocations,
'echo always > /sys/kernel/mm/transparent_hugepage/enabled' and
'echo always > /sys/kernel/mm/transparent_hugepage/defrag' run as root.

Environment Variables Notes

Environment variables set by runcpu before the start of the run:
GOMP_CPU_AFFINITY = "0-191"
LD_LIBRARY_PATH =
     "/home/cpu2017-1.1.8-amd-aocc400-genoa-B1b/amd_speed_aocc400_genoa_B_lib
     /lib:"
LIBOMP_NUM_HIDDEN_HELPER_THREADS = "0"
MALLOC_CONF = "oversize_threshold:0,retain:true"
OMP_DYNAMIC = "false"
OMP_SCHEDULE = "static"
OMP_STACKSIZE = "128M"
OMP_THREAD_LIMIT = "192"

Environment variables set by runcpu during the 600.perlbench_s peak run:
GOMP_CPU_AFFINITY = "15"

Environment variables set by runcpu during the 605.mcf_s peak run:
GOMP_CPU_AFFINITY = "15"

Environment variables set by runcpu during the 623.xalancbmk_s peak run:
GOMP_CPU_AFFINITY = "15"

Environment variables set by runcpu during the 657.xz_s peak run:
GOMP_CPU_AFFINITY = "0-191"
LIBOMP_NUM_HIDDEN_HELPER_THREADS = "8"

General Notes

Binaries were compiled on a system with 2x AMD EPYC 9174F CPU + 1.5TiB Memory using RHEL 8.6

NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown)
is mitigated in the system as tested and documented.
Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1)
is mitigated in the system as tested and documented.
Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2)
is mitigated in the system as tested and documented.

Platform Notes

BIOS configuration:
Operating Mode set to Maximum Performance and then set it to Custom Mode
NUMA Nodes per Socket set to NPS4
SMT Mode set to Disabled
L1 Region Prefetcher set to Disabled

 Sysinfo program /home/cpu2017-1.1.8-amd-aocc400-genoa-B1b/bin/sysinfo
 Rev: r6622 of 2021-04-07 982a61ec0915b55891ef0e16acafc64d
 running on localhost.localdomain Sun Oct 23 08:56:49 2022

 SUT (System Under Test) info as seen by some common utilities.
 For more information on this section, see
    https://www.spec.org/cpu2017/Docs/config.html#sysinfo

 From /proc/cpuinfo
    model name : AMD EPYC 9654 96-Core Processor
       2  "physical id"s (chips)
       192 "processors"
    cores, siblings (Caution: counting these is hw and system dependent. The following
    excerpts from /proc/cpuinfo might not be reliable.  Use with caution.)
       cpu cores : 96
       siblings  : 96
       physical 0: cores 0 1 2 3 4 5 6 7 16 17 18 19 20 21 22 23 36 37 38 39 52 53 54 55
       64 65 66 67 68 69 70 71 80 81 82 83 84 85 86 87 96 97 98 99 100 101 102 103 112 113
       114 115 116 117 118 119 128 129 130 131 132 133 134 135 144 145 146 147 148 149 150
       151 160 161 162 163 164 165 166 167 176 177 178 179 180 181 182 183
       physical 1: cores 0 1 2 3 4 5 6 7 16 17 18 19 20 21 22 23 36 37 38 39 52 53 54 55
       64 65 66 67 68 69 70 71 80 81 82 83 84 85 86 87 96 97 98 99 100 101 102 103 112 113
       114 115 116 117 118 119 128 129 130 131 132 133 134 135 144 145 146 147 148 149 150
       151 160 161 162 163 164 165 166 167 176 177 178 179 180 181 182 183

 From lscpu from util-linux 2.32.1:
      Architecture:        x86_64
      CPU op-mode(s):      32-bit, 64-bit
      Byte Order:          Little Endian
      CPU(s):              192
      On-line CPU(s) list: 0-191
      Thread(s) per core:  1
      Core(s) per socket:  96
      Socket(s):           2
      NUMA node(s):        8
      Vendor ID:           AuthenticAMD
      BIOS Vendor ID:      Advanced Micro Devices, Inc.
      CPU family:          25
      Model:               17
      Model name:          AMD EPYC 9654 96-Core Processor
      BIOS Model name:     AMD EPYC 9654 96-Core Processor
      Stepping:            1
      CPU MHz:             2400.000
      CPU max MHz:         3707.8120
      CPU min MHz:         1500.0000
      BogoMIPS:            4792.38
      Virtualization:      AMD-V
      L1d cache:           32K
      L1i cache:           32K
      L2 cache:            1024K
      L3 cache:            32768K
      NUMA node0 CPU(s):   0-23
      NUMA node1 CPU(s):   24-47
      NUMA node2 CPU(s):   48-71
      NUMA node3 CPU(s):   72-95
      NUMA node4 CPU(s):   96-119
      NUMA node5 CPU(s):   120-143
      NUMA node6 CPU(s):   144-167
      NUMA node7 CPU(s):   168-191
      Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
      pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm
      constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq
      monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c
      rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch
      osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb
      cat_l3 cdp_l3 invpcid_single hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase
      bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a avx512f avx512dq rdseed adx smap
      avx512ifma clflushopt clwb avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec xgetbv1
      xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local avx512_bf16 clzero irperf
      xsaveerptr wbnoinvd amd_ppin arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean
      flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif
      v_spec_ctrl avx512vbmi umip pku ospke avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni
      avx512_bitalg avx512_vpopcntdq la57 rdpid overflow_recov succor smca fsrm flush_l1d

 /proc/cpuinfo cache data
    cache size : 1024 KB

 From numactl --hardware
 WARNING: a numactl 'node' might or might not correspond to a physical chip.
   available: 8 nodes (0-7)
   node 0 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
   node 0 size: 96377 MB
   node 0 free: 95711 MB
   node 1 cpus: 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47
   node 1 size: 96754 MB
   node 1 free: 96422 MB
   node 2 cpus: 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71
   node 2 size: 96754 MB
   node 2 free: 96255 MB
   node 3 cpus: 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95
   node 3 size: 96754 MB
   node 3 free: 96434 MB
   node 4 cpus: 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114
   115 116 117 118 119
   node 4 size: 96712 MB
   node 4 free: 96407 MB
   node 5 cpus: 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137
   138 139 140 141 142 143
   node 5 size: 96685 MB
   node 5 free: 96377 MB
   node 6 cpus: 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161
   162 163 164 165 166 167
   node 6 size: 96754 MB
   node 6 free: 96448 MB
   node 7 cpus: 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185
   186 187 188 189 190 191
   node 7 size: 96754 MB
   node 7 free: 96448 MB
   node distances:
   node   0   1   2   3   4   5   6   7
     0:  10  12  12  12  32  32  32  32
     1:  12  10  12  12  32  32  32  32
     2:  12  12  10  12  32  32  32  32
     3:  12  12  12  10  32  32  32  32
     4:  32  32  32  32  10  12  12  12
     5:  32  32  32  32  12  10  12  12
     6:  32  32  32  32  12  12  10  12
     7:  32  32  32  32  12  12  12  10

 From /proc/meminfo
    MemTotal:       792111308 kB
    HugePages_Total:       0
    Hugepagesize:       2048 kB

 /sbin/tuned-adm active
     Current active profile: throughput-performance

 /sys/devices/system/cpu/cpu*/cpufreq/scaling_governor has
    performance

 From /etc/*release* /etc/*version*
    os-release:
       NAME="Red Hat Enterprise Linux"
       VERSION="8.6 (Ootpa)"
       ID="rhel"
       ID_LIKE="fedora"
       VERSION_ID="8.6"
       PLATFORM_ID="platform:el8"
       PRETTY_NAME="Red Hat Enterprise Linux 8.6 (Ootpa)"
       ANSI_COLOR="0;31"
    redhat-release: Red Hat Enterprise Linux release 8.6 (Ootpa)
    system-release: Red Hat Enterprise Linux release 8.6 (Ootpa)
    system-release-cpe: cpe:/o:redhat:enterprise_linux:8::baseos

 uname -a:
    Linux localhost.localdomain 4.18.0-372.9.1.el8.x86_64 #1 SMP Fri Apr 15 22:12:19 EDT
    2022 x86_64 x86_64 x86_64 GNU/Linux

 Kernel self-reported vulnerability status:

 CVE-2018-12207 (iTLB Multihit):                        Not affected
 CVE-2018-3620 (L1 Terminal Fault):                     Not affected
 Microarchitectural Data Sampling:                      Not affected
 CVE-2017-5754 (Meltdown):                              Not affected
 CVE-2018-3639 (Speculative Store Bypass):              Mitigation: Speculative Store
                                                        Bypass disabled via prctl and
                                                        seccomp
 CVE-2017-5753 (Spectre variant 1):                     Mitigation: usercopy/swapgs
                                                        barriers and __user pointer
                                                        sanitization
 CVE-2017-5715 (Spectre variant 2):                     Mitigation: Retpolines, IBPB:
                                                        conditional, IBRS_FW, STIBP:
                                                        disabled, RSB filling
 CVE-2020-0543 (Special Register Buffer Data Sampling): Not affected
 CVE-2019-11135 (TSX Asynchronous Abort):               Not affected

 run-level 3 Oct 23 08:53

 SPEC is set to: /home/cpu2017-1.1.8-amd-aocc400-genoa-B1b
    Filesystem     Type  Size  Used Avail Use% Mounted on
    /dev/sda4      xfs   373G  114G  260G  31% /home

 From /sys/devices/virtual/dmi/id
     Vendor:         Lenovo
     Product:        ThinkSystem SR665 V3 MB,Genoa,Kauai,DDR5,Kauai,2U
     Product Family: ThinkSystem
     Serial:         1234567890

 Additional information from dmidecode 3.3 follows.  WARNING: Use caution when you
 interpret this section. The 'dmidecode' program reads system data which is "intended to
 allow hardware to be accurately determined", but the intent may not be met, as there are
 frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard.
   Memory:
     6x SK Hynix HMCG88AEBRA115N 32 GB 2 rank 4800
     18x SK Hynix HMCG88AEBRA168N 32 GB 2 rank 4800

 BIOS:
    BIOS Vendor:       Lenovo
    BIOS Version:      KAE103A-1.10
    BIOS Date:         09/26/2022
    BIOS Revision:     1.10
    Firmware Revision: 1.0

 (End of data from sysinfo program)

Compiler Version Notes

==============================================================================
C       | 600.perlbench_s(base, peak) 602.gcc_s(base, peak) 605.mcf_s(base,
        | peak) 625.x264_s(base, peak) 657.xz_s(base, peak)
------------------------------------------------------------------------------
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#389 2022_10_07) (based on
  LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-rel-4.0-3206-389/bin
------------------------------------------------------------------------------

==============================================================================
C++     | 620.omnetpp_s(base, peak) 623.xalancbmk_s(base, peak)
        | 631.deepsjeng_s(base, peak) 641.leela_s(base, peak)
------------------------------------------------------------------------------
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#389 2022_10_07) (based on
  LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-rel-4.0-3206-389/bin
------------------------------------------------------------------------------

==============================================================================
Fortran | 648.exchange2_s(base, peak)
------------------------------------------------------------------------------
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#389 2022_10_07) (based on
  LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-rel-4.0-3206-389/bin
------------------------------------------------------------------------------

Base Compiler Invocation

C benchmarks:

 clang 

C++ benchmarks:

 clang++ 

Fortran benchmarks:

 flang 

Base Portability Flags

600.perlbench_s:  -DSPEC_LINUX_X64   -DSPEC_LP64 
602.gcc_s:  -DSPEC_LP64 
605.mcf_s:  -DSPEC_LP64 
620.omnetpp_s:  -DSPEC_LP64 
623.xalancbmk_s:  -DSPEC_LINUX   -DSPEC_LP64 
625.x264_s:  -DSPEC_LP64 
631.deepsjeng_s:  -DSPEC_LP64 
641.leela_s:  -DSPEC_LP64 
648.exchange2_s:  -DSPEC_LP64 
657.xz_s:  -DSPEC_LP64 

Base Optimization Flags

C benchmarks:

 -m64   -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-allow-multiple-definition   -O3   -march=znver4   -fveclib=AMDLIBM   -ffast-math   -fopenmp   -flto   -fstruct-layout=7   -mllvm -unroll-threshold=50   -mllvm -inline-threshold=1000   -fremap-arrays   -fstrip-mining   -mllvm -reduce-array-computations=3   -DSPEC_OPENMP   -zopt   -fopenmp=libomp   -lomp   -lamdlibm   -lflang   -lamdalloc 

C++ benchmarks:

 -m64   -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6   -Wl,-mllvm -Wl,-reduce-array-computations=3   -O3   -march=znver4   -fveclib=AMDLIBM   -ffast-math   -fopenmp   -flto   -mllvm -unroll-threshold=100   -finline-aggressive   -mllvm -loop-unswitch-threshold=200000   -mllvm -reduce-array-computations=3   -DSPEC_OPENMP   -zopt   -fvirtual-function-elimination   -fvisibility=hidden   -fopenmp=libomp   -lomp   -lamdlibm   -lflang   -lamdalloc-ext 

Fortran benchmarks:

 -m64   -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-mllvm -Wl,-inline-recursion=4   -Wl,-mllvm -Wl,-lsr-in-nested-loop   -Wl,-mllvm -Wl,-enable-iv-split   -O3   -march=znver4   -fveclib=AMDLIBM   -ffast-math   -fopenmp   -flto   -mllvm -optimize-strided-mem-cost   -mllvm -unroll-aggressive   -mllvm -unroll-threshold=150   -fopenmp=libomp   -lomp   -lamdlibm   -lflang   -lamdalloc 

Base Other Flags

C benchmarks:

 -Wno-return-type   -Wno-unused-command-line-argument 

C++ benchmarks:

 -Wno-unused-command-line-argument 

Fortran benchmarks:

 -Wno-unused-command-line-argument 

Peak Compiler Invocation

C benchmarks:

 clang 

C++ benchmarks:

 clang++ 

Fortran benchmarks:

 flang 

Peak Portability Flags

Same as Base Portability Flags

Peak Optimization Flags

C benchmarks:

600.perlbench_s:  -m64   -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-allow-multiple-definition   -Ofast   -march=znver4   -fveclib=AMDLIBM   -ffast-math   -fopenmp   -flto   -fstruct-layout=9   -mllvm -unroll-threshold=50   -fremap-arrays   -fstrip-mining   -mllvm -inline-threshold=1000   -mllvm -reduce-array-computations=3   -DSPEC_OPENMP   -zopt   -fopenmp=libomp   -lomp   -lamdlibm   -lamdalloc   -lflang 
602.gcc_s:  basepeak = yes 
605.mcf_s:  Same as 600.perlbench_s 
625.x264_s:  basepeak = yes 
657.xz_s:  Same as 600.perlbench_s 

C++ benchmarks:

620.omnetpp_s:  basepeak = yes 
623.xalancbmk_s:  -m64   -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-mllvm -Wl,-do-block-reorder=aggressive   -Ofast   -march=znver4   -fveclib=AMDLIBM   -ffast-math   -fopenmp   -flto   -finline-aggressive   -mllvm -unroll-threshold=100   -mllvm -reduce-array-computations=3   -DSPEC_OPENMP   -zopt   -mllvm -do-block-reorder=aggressive   -fvirtual-function-elimination   -fvisibility=hidden   -fopenmp=libomp   -lomp   -lamdlibm   -lamdalloc-ext   -lflang 
631.deepsjeng_s:  basepeak = yes 
641.leela_s:  basepeak = yes 

Fortran benchmarks:

648.exchange2_s:  basepeak = yes 

Peak Other Flags

C benchmarks:

 -Wno-return-type   -Wno-unused-command-line-argument 

C++ benchmarks:

 -Wno-unused-command-line-argument 

Fortran benchmarks:

 -Wno-unused-command-line-argument 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2017/flags/aocc400-flags.html,
http://www.spec.org/cpu2017/flags/Lenovo-Platform-SPECcpu2017-Flags-V1.2-Genoa-N.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2017/flags/aocc400-flags.xml,
http://www.spec.org/cpu2017/flags/Lenovo-Platform-SPECcpu2017-Flags-V1.2-Genoa-N.xml.