SPEC CPU®2017 Floating Point Rate Result
Copyright 2017-2023 Standard Performance Evaluation Corporation
Benchmark result graphs are available in the PDF report.
The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.
'ulimit -s unlimited' was used to set environment stack size limit
'ulimit -l 2097152' was used to set environment locked pages in memory limit
runcpu command invoked through numactl i.e.:
numactl --interleave=all runcpu <etc>
To limit dirty cache to 8% of memory, 'sysctl -w vm.dirty_ratio=8' run as root.
To limit swap usage to minimum necessary, 'sysctl -w vm.swappiness=1' run as root.
To free node-local memory and avoid remote memory usage,
'sysctl -w vm.zone_reclaim_mode=1' run as root.
To clear filesystem caches, 'sync; sysctl -w vm.drop_caches=3' run as root.
To disable address space layout randomization (ASLR) to reduce run-to-run
variability, 'sysctl -w kernel.randomize_va_space=0' run as root.
To enable Transparent Hugepages (THP) only on request for base runs,
'echo madvise > /sys/kernel/mm/transparent_hugepage/enabled' run as root.
To enable THP for all allocations for peak runs,
'echo always > /sys/kernel/mm/transparent_hugepage/enabled' and
'echo always > /sys/kernel/mm/transparent_hugepage/defrag' run as root.
Environment variables set by runcpu before the start of the run:
LD_LIBRARY_PATH =
"/home/cpu2017/amd_rate_aocc400_znver4_A_lib/lib:/home/cpu2017/amd_rate_aocc400_znver4_A_lib/lib32:"
MALLOC_CONF = "retain:true"
Binaries were compiled on a system with 2x AMD EPYC 9174F CPU + 1.5TiB Memory using RHEL 8.6
NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown)
is mitigated in the system as tested and documented.
Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1)
is mitigated in the system as tested and documented.
Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2)
is mitigated in the system as tested and documented.
BIOS Settings:
cTDP: 360
Determinism Slider set to Power
Package Power: 360
EDC: 400
ACPI SRAT L3 Cache as NUMA Domain: enabled
Memory interleaving: Disabled
4-link xGMI max speed: 16Gbps
Fan Speed: Maximum
Sysinfo program /home/cpu2017/bin/sysinfo
Rev: r6732 of 2022-11-07 fe91c89b7ed5c36ae2c92cc097bec197
running on amd2-Super-Server Thu Sep 28 00:53:27 2023
SUT (System Under Test) info as seen by some common utilities.
------------------------------------------------------------
Table of contents
------------------------------------------------------------
1. uname -a
2. w
3. Username
4. ulimit -a
5. sysinfo process ancestry
6. /proc/cpuinfo
7. lscpu
8. numactl --hardware
9. /proc/meminfo
10. who -r
11. Systemd service manager version: systemd 245 (245.4-4ubuntu3.20)
12. Failed units, from systemctl list-units --state=failed
13. Services, from systemctl list-unit-files
14. Linux kernel boot-time arguments, from /proc/cmdline
15. sysctl
16. /sys/kernel/mm/transparent_hugepage
17. /sys/kernel/mm/transparent_hugepage/khugepaged
18. OS release
19. Disk information
20. /sys/devices/virtual/dmi/id
21. dmidecode
22. BIOS
------------------------------------------------------------
------------------------------------------------------------
1. uname -a
Linux amd2-Super-Server 5.15.0-84-generic #93~20.04.1-Ubuntu SMP Wed Sep 6 16:15:40 UTC 2023 x86_64 x86_64
x86_64 GNU/Linux
------------------------------------------------------------
2. w
00:53:27 up 8:01, 1 user, load average: 158.59, 318.06, 354.78
USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT
amd2 tty2 - 16:53 7:57m 2.23s 0.05s -bash
------------------------------------------------------------
3. Username
From environment variable $USER: root
From the command 'logname': amd2
------------------------------------------------------------
4. ulimit -a
time(seconds) unlimited
file(blocks) unlimited
data(kbytes) unlimited
stack(kbytes) unlimited
coredump(blocks) 0
memory(kbytes) unlimited
locked memory(kbytes) 2097152
process 5932677
nofiles 1024
vmemory(kbytes) unlimited
locks unlimited
rtprio 0
------------------------------------------------------------
5. sysinfo process ancestry
/sbin/init splash
/bin/login -p --
-bash
sudo su
su
bash
python3 ./run_amd_rate_aocc400_znver4_A1.py
/bin/bash ./amd_rate_aocc400_znver4_A1.sh
runcpu --config amd_rate_aocc400_znver4_A1.cfg --tune all --reportable --iterations 3 fprate
runcpu --configfile amd_rate_aocc400_znver4_A1.cfg --tune all --reportable --iterations 3 --nopower
--runmode rate --tune base:peak --size test:train:refrate fprate --nopreenv --note-preenv --logfile
$SPEC/tmp/CPU2017.002/templogs/preenv.fprate.002.0.log --lognum 002.0 --from_runcpu 2
specperl $SPEC/bin/sysinfo
$SPEC = /home/cpu2017
------------------------------------------------------------
6. /proc/cpuinfo
model name : AMD EPYC 9654 96-Core Processor
vendor_id : AuthenticAMD
cpu family : 25
model : 17
stepping : 1
microcode : 0xa10113e
bugs : sysret_ss_attrs spectre_v1 spectre_v2 spec_store_bypass
TLB size : 3584 4K pages
cpu cores : 96
siblings : 192
2 physical ids (chips)
384 processors (hardware threads)
physical id 0: core ids 0-95
physical id 1: core ids 0-95
physical id 0: apicids 0-191
physical id 1: apicids 256-447
Caution: /proc/cpuinfo data regarding chips, cores, and threads is not necessarily reliable, especially for
virtualized systems. Use the above data carefully.
------------------------------------------------------------
7. lscpu
From lscpu from util-linux 2.34:
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
Address sizes: 52 bits physical, 57 bits virtual
CPU(s): 384
On-line CPU(s) list: 0-383
Thread(s) per core: 2
Core(s) per socket: 96
Socket(s): 2
NUMA node(s): 2
Vendor ID: AuthenticAMD
CPU family: 25
Model: 17
Model name: AMD EPYC 9654 96-Core Processor
Stepping: 1
Frequency boost: enabled
CPU MHz: 1500.000
CPU max MHz: 3707.8120
CPU min MHz: 1500.0000
BogoMIPS: 4799.74
Virtualization: AMD-V
L1d cache: 6 MiB
L1i cache: 6 MiB
L2 cache: 192 MiB
L3 cache: 768 MiB
NUMA node0 CPU(s): 0-95,192-287
NUMA node1 CPU(s): 96-191,288-383
Vulnerability Gather data sampling: Not affected
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf: Not affected
Vulnerability Mds: Not affected
Vulnerability Meltdown: Not affected
Vulnerability Mmio stale data: Not affected
Vulnerability Retbleed: Not affected
Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl and seccomp
Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2: Mitigation; Retpolines, IBPB conditional, IBRS_FW, STIBP always-on, RSB
filling, PBRSB-eIBRS Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Not affected
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36
clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp
lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf
rapl pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic
movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic
cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce
topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3
cdp_l3 invpcid_single hw_pstate ssbd mba ibrs ibpb stibp vmmcall
fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a avx512f avx512dq
rdseed adx smap avx512ifma clflushopt clwb avx512cd sha_ni avx512bw
avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc
cqm_mbm_total cqm_mbm_local avx512_bf16 clzero irperf xsaveerptr rdpru
wbnoinvd amd_ppin cppc arat npt lbrv svm_lock nrip_save tsc_scale
vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic
v_vmsave_vmload vgif v_spec_ctrl avx512vbmi umip pku ospke avx512_vbmi2
gfni vaes vpclmulqdq avx512_vnni avx512_bitalg avx512_vpopcntdq la57
rdpid overflow_recov succor smca fsrm flush_l1d
From lscpu --cache:
NAME ONE-SIZE ALL-SIZE WAYS TYPE LEVEL
L1d 32K 6M 8 Data 1
L1i 32K 6M 8 Instruction 1
L2 1M 192M 8 Unified 2
L3 32M 768M 16 Unified 3
------------------------------------------------------------
8. numactl --hardware
NOTE: a numactl 'node' might or might not correspond to a physical chip.
available: 2 nodes (0-1)
node 0 cpus: 0-95,192-287
node 0 size: 709209 MB
node 0 free: 705336 MB
node 1 cpus: 96-191,288-383
node 1 size: 774035 MB
node 1 free: 768577 MB
node distances:
node 0 1
0: 10 32
1: 32 10
------------------------------------------------------------
9. /proc/meminfo
MemTotal: 1518842524 kB
------------------------------------------------------------
10. who -r
run-level 5 Sep 26 16:41
------------------------------------------------------------
11. Systemd service manager version: systemd 245 (245.4-4ubuntu3.20)
Default Target Status
graphical degraded
------------------------------------------------------------
12. Failed units, from systemctl list-units --state=failed
UNIT LOAD ACTIVE SUB DESCRIPTION
* fwupd-refresh.service loaded failed failed Refresh fwupd metadata and update motd
------------------------------------------------------------
13. Services, from systemctl list-unit-files
STATE UNIT FILES
enabled ModemManager NetworkManager NetworkManager-dispatcher NetworkManager-wait-online
accounts-daemon anacron apparmor autovt@ avahi-daemon bluetooth console-setup cron cups
cups-browsed dmesg e2scrub_reap getty@ gpu-manager grub-common grub-initrd-fallback
irqbalance kerneloops keyboard-setup network-manager networkd-dispatcher ondemand openvpn
pppd-dns rsync rsyslog secureboot-db setvtrgb snapd ssh sshd switcheroo-control syslog
systemd-pstore systemd-resolved systemd-timesyncd thermald ua-reboot-cmds udisks2 ufw
unattended-upgrades whoopsie wpa_supplicant
enabled-runtime netplan-ovs-cleanup systemd-fsck-root systemd-remount-fs
disabled acpid brltty console-getty debug-shell openvpn-client@ openvpn-server@ openvpn@
rtkit-daemon serial-getty@ speech-dispatcher speech-dispatcherd
systemd-boot-check-no-failures systemd-network-generator systemd-networkd
systemd-networkd-wait-online systemd-time-wait-sync upower wpa_supplicant-nl80211@
wpa_supplicant-wired@ wpa_supplicant@
generated apport
indirect display-manager lightdm saned@ spice-vdagent spice-vdagentd uuidd
masked alsa-utils cryptdisks cryptdisks-early hwclock pulseaudio-enable-autospawn rc rcS saned
sudo x11-common
------------------------------------------------------------
14. Linux kernel boot-time arguments, from /proc/cmdline
BOOT_IMAGE=/boot/vmlinuz-5.15.0-84-generic
root=UUID=1ae71a13-cac0-48f6-b6e6-e15e5e687f57
ro
quiet
splash
vt.handoff=7
------------------------------------------------------------
15. sysctl
kernel.numa_balancing 1
kernel.randomize_va_space 0
vm.compaction_proactiveness 20
vm.dirty_background_bytes 0
vm.dirty_background_ratio 10
vm.dirty_bytes 0
vm.dirty_expire_centisecs 3000
vm.dirty_ratio 8
vm.dirty_writeback_centisecs 500
vm.dirtytime_expire_seconds 43200
vm.extfrag_threshold 500
vm.min_unmapped_ratio 1
vm.nr_hugepages 0
vm.nr_hugepages_mempolicy 0
vm.nr_overcommit_hugepages 0
vm.swappiness 1
vm.watermark_boost_factor 15000
vm.watermark_scale_factor 10
vm.zone_reclaim_mode 1
------------------------------------------------------------
16. /sys/kernel/mm/transparent_hugepage
defrag [always] defer defer+madvise madvise never
enabled [always] madvise never
hpage_pmd_size 2097152
shmem_enabled always within_size advise [never] deny force
------------------------------------------------------------
17. /sys/kernel/mm/transparent_hugepage/khugepaged
alloc_sleep_millisecs 60000
defrag 1
max_ptes_none 511
max_ptes_shared 256
max_ptes_swap 64
pages_to_scan 4096
scan_sleep_millisecs 10000
------------------------------------------------------------
18. OS release
From /etc/*-release /etc/*-version
os-release Ubuntu 20.04.4 LTS
------------------------------------------------------------
19. Disk information
SPEC is set to: /home/cpu2017
Filesystem Type Size Used Avail Use% Mounted on
/dev/nvme1n1p2 ext4 938G 19G 872G 3% /
------------------------------------------------------------
20. /sys/devices/virtual/dmi/id
Vendor: Tyrone Systems
Product: Tyrone Camarero SDA200A2N-212
Product Family: SMC H13
Serial: A509935X3906531
------------------------------------------------------------
21. dmidecode
Additional information from dmidecode 3.2 follows. WARNING: Use caution when you interpret this section.
The 'dmidecode' program reads system data which is "intended to allow hardware to be accurately
determined", but the intent may not be met, as there are frequent changes to hardware, firmware, and the
"DMTF SMBIOS" standard.
Memory:
1x NO DIMM NO DIMM
23x Samsung M321R8GA0BB0-CQKZJ 64 GB 2 rank 4800
------------------------------------------------------------
22. BIOS
(This section combines info from /sys/devices and dmidecode.)
BIOS Vendor: American Megatrends International, LLC.
BIOS Version: 1.4
BIOS Date: 04/19/2023
BIOS Revision: 5.27
============================================================================================================
C | 519.lbm_r(base, peak) 538.imagick_r(base, peak) 544.nab_r(base, peak)
------------------------------------------------------------------------------------------------------------
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
------------------------------------------------------------------------------------------------------------
============================================================================================================
C++ | 508.namd_r(base, peak) 510.parest_r(base, peak)
------------------------------------------------------------------------------------------------------------
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
------------------------------------------------------------------------------------------------------------
============================================================================================================
C++, C | 511.povray_r(base, peak) 526.blender_r(base, peak)
------------------------------------------------------------------------------------------------------------
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
------------------------------------------------------------------------------------------------------------
============================================================================================================
C++, C, Fortran | 507.cactuBSSN_r(base, peak)
------------------------------------------------------------------------------------------------------------
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
------------------------------------------------------------------------------------------------------------
============================================================================================================
Fortran | 503.bwaves_r(base, peak) 549.fotonik3d_r(base, peak) 554.roms_r(base, peak)
------------------------------------------------------------------------------------------------------------
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
------------------------------------------------------------------------------------------------------------
============================================================================================================
Fortran, C | 521.wrf_r(base, peak) 527.cam4_r(base, peak)
------------------------------------------------------------------------------------------------------------
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
AMD clang version 14.0.6 (CLANG: AOCC_4.0.0-Build#434 2022_10_28) (based on LLVM Mirror.Version.14.0.6)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/AMD/aocc/aocc-compiler-4.0.0/bin
------------------------------------------------------------------------------------------------------------
Same as Base Portability Flags
508.namd_r: |
-m64
-flto
-Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6
-Wl,-mllvm -Wl,-reduce-array-computations=3
-Wl,-mllvm -Wl,-x86-use-vzeroupper=false
-Ofast
-march=znver4
-fveclib=AMDLIBM
-ffast-math
-finline-aggressive
-mllvm -unroll-threshold=100
-mllvm -reduce-array-computations=3
-zopt
-lamdlibm
-lamdalloc
|
510.parest_r: |
-m64
-flto
-Wl,-mllvm -Wl,-suppress-fmas
-Wl,-mllvm -Wl,-x86-use-vzeroupper=false
-Ofast
-march=znver4
-fveclib=AMDLIBM
-ffast-math
-finline-aggressive
-mllvm -unroll-threshold=100
-mllvm -reduce-array-computations=3
-zopt
-lamdlibm
-lamdalloc
|
503.bwaves_r: |
-m64
-flto
-Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6
-Wl,-mllvm -Wl,-reduce-array-computations=3
-Wl,-mllvm -Wl,-enable-X86-prefetching
-Ofast
-march=znver4
-fveclib=AMDLIBM
-ffast-math
-Mrecursive
-mllvm -reduce-array-computations=3
-fepilog-vectorization-of-inductions
-zopt
-lamdlibm
-lamdalloc
-lflang
|
549.fotonik3d_r: |
-m64
-flto
-Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6
-Wl,-mllvm -Wl,-reduce-array-computations=3
-Wl,-mllvm -Wl,-enable-X86-prefetching
-Ofast
-march=znver4
-fveclib=AMDLIBM
-ffast-math
-Kieee
-Mrecursive
-mllvm -reduce-array-computations=3
-fepilog-vectorization-of-inductions
-fvector-transform
-fscalar-transform
-lamdlibm
-lamdalloc
-lflang
|
554.roms_r: |
Same as 503.bwaves_r
|
521.wrf_r: |
-m64
-flto
-Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6
-Wl,-mllvm -Wl,-reduce-array-computations=3
-Wl,-mllvm -Wl,-enable-X86-prefetching
-Ofast
-march=znver4
-fveclib=AMDLIBM
-ffast-math
-fstruct-layout=7
-mllvm -unroll-threshold=50
-fremap-arrays
-fstrip-mining
-mllvm -inline-threshold=1000
-mllvm -reduce-array-computations=3
-zopt
-Mrecursive
-fepilog-vectorization-of-inductions
-lamdlibm
-lamdalloc
-lflang
|
527.cam4_r: |
-m64
-flto
-Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6
-Wl,-mllvm -Wl,-reduce-array-computations=3
-Wl,-mllvm -Wl,-enable-X86-prefetching
-O3
-march=znver4
-fveclib=AMDLIBM
-ffast-math
-fstruct-layout=7
-mllvm -unroll-threshold=50
-mllvm -inline-threshold=1000
-fremap-arrays
-mllvm -reduce-array-computations=3
-zopt
-Kieee
-Mrecursive
-funroll-loops
-mllvm -lsr-in-nested-loop
-fepilog-vectorization-of-inductions
-lamdlibm
-lamdalloc
-lflang
|
511.povray_r: |
-m64
-flto
-Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6
-Wl,-mllvm -Wl,-reduce-array-computations=3
-Wl,-mllvm -Wl,-x86-use-vzeroupper=false
-O3
-march=znver4
-fveclib=AMDLIBM
-ffast-math
-fstruct-layout=7
-mllvm -unroll-threshold=50
-mllvm -inline-threshold=1000
-fremap-arrays
-mllvm -reduce-array-computations=3
-zopt
-mllvm -unroll-threshold=100
-finline-aggressive
-mllvm -loop-unswitch-threshold=200000
-lamdlibm
-lamdalloc
|
526.blender_r: |
-m64
-flto
-Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6
-Wl,-mllvm -Wl,-reduce-array-computations=3
-Wl,-mllvm -Wl,-x86-use-vzeroupper=false
-Ofast
-march=znver4
-fveclib=AMDLIBM
-ffast-math
-fstruct-layout=7
-mllvm -unroll-threshold=50
-fremap-arrays
-fstrip-mining
-mllvm -inline-threshold=1000
-mllvm -reduce-array-computations=3
-zopt
-finline-aggressive
-mllvm -unroll-threshold=100
-lamdlibm
-lamdalloc
|