SPEC CPU(R)2017 Floating Point Rate Result Hewlett Packard Enterprise ProLiant DL325 Gen10 Plus v2 (3.05 GHz, AMD EPYC 7373X) Test Sponsor: HPE CPU2017 License: 3 Test date: Feb-2022 Test sponsor: HPE Hardware availability: Mar-2022 Tested by: HPE Software availability: Jan-2022 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate --------------- ------- --------- --------- ------- --------- --------- 503.bwaves_r 32 832 386 S 32 832 386 S 503.bwaves_r 32 832 386 * 32 832 386 * 503.bwaves_r 32 834 385 S 32 834 385 S 507.cactuBSSN_r 32 164 247 S 32 168 241 * 507.cactuBSSN_r 32 170 239 S 32 166 244 S 507.cactuBSSN_r 32 169 240 * 32 168 241 S 508.namd_r 32 279 109 S 32 278 109 * 508.namd_r 32 280 109 * 32 278 109 S 508.namd_r 32 280 109 S 32 278 110 S 510.parest_r 32 382 219 S 32 382 219 S 510.parest_r 32 380 220 * 32 380 220 * 510.parest_r 32 379 221 S 32 379 221 S 511.povray_r 32 445 168 * 32 445 168 * 511.povray_r 32 444 168 S 32 444 168 S 511.povray_r 32 445 168 S 32 445 168 S 519.lbm_r 32 478 70.6 S 32 386 87.3 * 519.lbm_r 32 477 70.7 * 32 386 87.4 S 519.lbm_r 32 465 72.6 S 32 386 87.3 S 521.wrf_r 32 339 211 * 32 320 224 S 521.wrf_r 32 341 210 S 32 321 223 S 521.wrf_r 32 338 212 S 32 320 224 * 526.blender_r 32 269 181 S 32 269 181 S 526.blender_r 32 270 180 S 32 270 180 S 526.blender_r 32 270 181 * 32 270 181 * 527.cam4_r 32 337 166 S 32 336 166 S 527.cam4_r 32 341 164 S 32 337 166 * 527.cam4_r 32 340 165 * 32 339 165 S 538.imagick_r 32 139 574 S 32 136 586 * 538.imagick_r 32 138 575 S 32 135 588 S 538.imagick_r 32 138 575 * 32 136 583 S 544.nab_r 32 257 210 * 32 254 212 S 544.nab_r 32 258 209 S 32 256 210 S 544.nab_r 32 256 211 S 32 254 212 * 549.fotonik3d_r 32 1001 125 S 32 993 126 S 549.fotonik3d_r 32 992 126 S 32 983 127 S 549.fotonik3d_r 32 994 125 * 32 985 127 * 554.roms_r 32 349 146 S 32 331 154 * 554.roms_r 32 350 145 S 32 332 153 S 554.roms_r 32 349 146 * 32 328 155 S ================================================================================= 503.bwaves_r 32 832 386 * 32 832 386 * 507.cactuBSSN_r 32 169 240 * 32 168 241 * 508.namd_r 32 280 109 * 32 278 109 * 510.parest_r 32 380 220 * 32 380 220 * 511.povray_r 32 445 168 * 32 445 168 * 519.lbm_r 32 477 70.7 * 32 386 87.3 * 521.wrf_r 32 339 211 * 32 320 224 * 526.blender_r 32 270 181 * 32 270 181 * 527.cam4_r 32 340 165 * 32 337 166 * 538.imagick_r 32 138 575 * 32 136 586 * 544.nab_r 32 257 210 * 32 254 212 * 549.fotonik3d_r 32 994 125 * 32 985 127 * 554.roms_r 32 349 146 * 32 331 154 * SPECrate(R)2017_fp_base 188 SPECrate(R)2017_fp_peak 194 HARDWARE -------- CPU Name: AMD EPYC 7373X Max MHz: 3800 Nominal: 3050 Enabled: 16 cores, 1 chip, 2 threads/core Orderable: 1 chip Cache L1: 32 KB I + 32 KB D on chip per core L2: 512 KB I+D on chip per core L3: 768 MB I+D on chip per chip, 96 MB shared / 2 cores Other: None Memory: 1 TB (8 x 128 GB 4Rx4 PC4-3200AA-L) Storage: 1 x 480 GB SATA SSD, RAID 0 Other: None SOFTWARE -------- OS: Ubuntu 20.04.3 LTS Kernel 5.13.0-28-generic Compiler: C/C++/Fortran: Version 3.2.0 of AOCC Parallel: No Firmware: HPE BIOS Version A43 v2.56 02/10/2022 released Feb-2022 File System: ext4 System State: Run level 5 (multi-user) Base Pointers: 64-bit Peak Pointers: 64-bit Other: jemalloc: jemalloc memory allocator library v5.1.0 Power Management: BIOS set to prefer performance at the cost of additional power usage Compiler Notes -------------- The AMD64 AOCC Compiler Suite is available at http://developer.amd.com/amd-aocc/ Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size limit 'ulimit -l 2097152' was used to set environment locked pages in memory limit runcpu command invoked through numactl i.e.: numactl --interleave=all runcpu To limit dirty cache to 8% of memory, 'sysctl -w vm.dirty_ratio=8' run as root. To limit swap usage to minimum necessary, 'sysctl -w vm.swappiness=1' run as root. To free node-local memory and avoid remote memory usage, 'sysctl -w vm.zone_reclaim_mode=1' run as root. To clear filesystem caches, 'sync; sysctl -w vm.drop_caches=3' run as root. To disable address space layout randomization (ASLR) to reduce run-to-run variability, 'sysctl -w kernel.randomize_va_space=0' run as root. To enable Transparent Hugepages (THP) for all allocations, 'echo always > /sys/kernel/mm/transparent_hugepage/enabled' and 'echo always > /sys/kernel/mm/transparent_hugepage/defrag' run as root. Environment Variables Notes --------------------------- Environment variables set by runcpu before the start of the run: LD_LIBRARY_PATH = "/home/oem/cpu2017/amd_rate_aocc320_milanx_A_lib/lib;/home/oem/cpu2017/a md_rate_aocc320_milanx_A_lib/lib32:" MALLOC_CONF = "retain:true" General Notes ------------- Binaries were compiled on a system with 2x AMD EPYC 7742 CPU + 1TiB Memory using OpenSUSE 15.2 NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) is mitigated in the system as tested and documented. jemalloc: configured and built with GCC v4.8.2 in RHEL 7.4 (No options specified) jemalloc 5.1.0 is available here: https://github.com/jemalloc/jemalloc/releases/download/5.1.0/jemalloc-5.1.0.tar.bz2 Platform Notes -------------- BIOS Configuration Workload Profile set to General Throughput Compute Determinism Control set to Manual Performance Determinism set to Power Deterministic Memory Interleaving Mode set to Disabled Last-Level Cache (LLC) as NUMA Node set to Enabled NUMA memory domains per socket set to Four memory domains per socket Infinity Fabric Power Management set to Disabled Infinity Fabric Performance State set to P0 Thermal Configuration set to Maximum Cooling Workload Profile set to Custom L2 HW Prefetcher set to Disabled The system date and time as discovered by sysinfo is incorrect as the time was not updated prior to the run. The test_date field shows an accurate date for the result. The system ROM used for this result contains microcode version 0x 0A001227h for the AMD EPYC 7nn3X family of processors. The reference code/AGESA version used in this ROM is version MilanPI 1.0.0.8. Sysinfo program /home/oem/cpu2017/bin/sysinfo Rev: r6622 of 2021-04-07 982a61ec0915b55891ef0e16acafc64d running on oem-dl325Gen10Plus Mon Jan 10 15:45:50 2022 SUT (System Under Test) info as seen by some common utilities. For more information on this section, see https://www.spec.org/cpu2017/Docs/config.html#sysinfo From /proc/cpuinfo model name : AMD EPYC 7373X 16-Core Processor 1 "physical id"s (chips) 32 "processors" cores, siblings (Caution: counting these is hw and system dependent. The following excerpts from /proc/cpuinfo might not be reliable. Use with caution.) cpu cores : 16 siblings : 32 physical 0: cores 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 From lscpu from util-linux 2.34: Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian Address sizes: 48 bits physical, 48 bits virtual CPU(s): 32 On-line CPU(s) list: 0-31 Thread(s) per core: 2 Core(s) per socket: 16 Socket(s): 1 NUMA node(s): 8 Vendor ID: AuthenticAMD CPU family: 25 Model: 1 Model name: AMD EPYC 7373X 16-Core Processor Stepping: 2 CPU MHz: 3044.249 BogoMIPS: 6088.49 Virtualization: AMD-V L1d cache: 512 KiB L1i cache: 512 KiB L2 cache: 8 MiB L3 cache: 768 MiB NUMA node0 CPU(s): 0,1,16,17 NUMA node1 CPU(s): 2,3,18,19 NUMA node2 CPU(s): 4,5,20,21 NUMA node3 CPU(s): 6,7,22,23 NUMA node4 CPU(s): 8,9,24,25 NUMA node5 CPU(s): 10,11,26,27 NUMA node6 CPU(s): 12,13,28,29 NUMA node7 CPU(s): 14,15,30,31 Vulnerability Itlb multihit: Not affected Vulnerability L1tf: Not affected Vulnerability Mds: Not affected Vulnerability Meltdown: Not affected Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl and seccomp Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization Vulnerability Spectre v2: Mitigation; Full AMD retpoline, IBPB conditional, IBRS_FW, STIBP always-on, RSB filling Vulnerability Srbds: Not affected Vulnerability Tsx async abort: Not affected Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 invpcid_single hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 invpcid cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd amd_ppin arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload vgif v_spec_ctrl umip pku ospke vaes vpclmulqdq rdpid overflow_recov succor smca From lscpu --cache: NAME ONE-SIZE ALL-SIZE WAYS TYPE LEVEL L1d 32K 512K 8 Data 1 L1i 32K 512K 8 Instruction 1 L2 512K 8M 8 Unified 2 L3 96M 768M 16 Unified 3 /proc/cpuinfo cache data cache size : 512 KB From numactl --hardware WARNING: a numactl 'node' might or might not correspond to a physical chip. available: 8 nodes (0-7) node 0 cpus: 0 1 16 17 node 0 size: 128712 MB node 0 free: 128137 MB node 1 cpus: 2 3 18 19 node 1 size: 129020 MB node 1 free: 128915 MB node 2 cpus: 4 5 20 21 node 2 size: 129022 MB node 2 free: 128870 MB node 3 cpus: 6 7 22 23 node 3 size: 129021 MB node 3 free: 128775 MB node 4 cpus: 8 9 24 25 node 4 size: 129022 MB node 4 free: 128857 MB node 5 cpus: 10 11 26 27 node 5 size: 129021 MB node 5 free: 128783 MB node 6 cpus: 12 13 28 29 node 6 size: 129022 MB node 6 free: 128841 MB node 7 cpus: 14 15 30 31 node 7 size: 128975 MB node 7 free: 128826 MB node distances: node 0 1 2 3 4 5 6 7 0: 10 11 12 12 12 12 12 12 1: 11 10 12 12 12 12 12 12 2: 12 12 10 11 12 12 12 12 3: 12 12 11 10 12 12 12 12 4: 12 12 12 12 10 11 12 12 5: 12 12 12 12 11 10 12 12 6: 12 12 12 12 12 12 10 11 7: 12 12 12 12 12 12 11 10 From /proc/meminfo MemTotal: 1056584228 kB HugePages_Total: 0 Hugepagesize: 2048 kB /sbin/tuned-adm active Current active profile: throughput-performance /usr/bin/lsb_release -d Ubuntu 20.04.3 LTS From /etc/*release* /etc/*version* debian_version: bullseye/sid os-release: NAME="Ubuntu" VERSION="20.04.3 LTS (Focal Fossa)" ID=ubuntu ID_LIKE=debian PRETTY_NAME="Ubuntu 20.04.3 LTS" VERSION_ID="20.04" HOME_URL="https://www.ubuntu.com/" SUPPORT_URL="https://help.ubuntu.com/" uname -a: Linux oem-dl325Gen10Plus 5.13.0-28-generic #31~20.04.1-Ubuntu SMP Wed Jan 19 14:08:10 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux Kernel self-reported vulnerability status: CVE-2018-12207 (iTLB Multihit): Not affected CVE-2018-3620 (L1 Terminal Fault): Not affected Microarchitectural Data Sampling: Not affected CVE-2017-5754 (Meltdown): Not affected CVE-2018-3639 (Speculative Store Bypass): Mitigation: Speculative Store Bypass disabled via prctl and seccomp CVE-2017-5753 (Spectre variant 1): Mitigation: usercopy/swapgs barriers and __user pointer sanitization CVE-2017-5715 (Spectre variant 2): Mitigation: Full AMD retpoline, IBPB: conditional, IBRS_FW, STIBP: always-on, RSB filling CVE-2020-0543 (Special Register Buffer Data Sampling): Not affected CVE-2019-11135 (TSX Asynchronous Abort): Not affected run-level 5 Jan 10 10:27 SPEC is set to: /home/oem/cpu2017 Filesystem Type Size Used Avail Use% Mounted on /dev/sda2 ext4 439G 17G 401G 4% / From /sys/devices/virtual/dmi/id Vendor: HPE Product: ProLiant DL325 Gen10 Plus v2 Product Family: ProLiant Serial: CN70381LLR Additional information from dmidecode 3.2 follows. WARNING: Use caution when you interpret this section. The 'dmidecode' program reads system data which is "intended to allow hardware to be accurately determined", but the intent may not be met, as there are frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard. Memory: 8x Hynix HMABAGL7ABR4N-XN 128 GB 4 rank 3200 8x UNKNOWN NOT AVAILABLE BIOS: BIOS Vendor: HPE BIOS Version: A43 BIOS Date: 02/10/2022 BIOS Revision: 2.56 Firmware Revision: 2.60 (End of data from sysinfo program) Compiler Version Notes ---------------------- ============================================================================== C | 519.lbm_r(base, peak) 538.imagick_r(base, peak) | 544.nab_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin ------------------------------------------------------------------------------ ============================================================================== C++ | 508.namd_r(base, peak) 510.parest_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin ------------------------------------------------------------------------------ ============================================================================== C++, C | 511.povray_r(base, peak) 526.blender_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin ------------------------------------------------------------------------------ ============================================================================== C++, C, Fortran | 507.cactuBSSN_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin ------------------------------------------------------------------------------ ============================================================================== Fortran | 503.bwaves_r(base, peak) 549.fotonik3d_r(base, peak) | 554.roms_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin ------------------------------------------------------------------------------ ============================================================================== Fortran, C | 521.wrf_r(base, peak) 527.cam4_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin AMD clang version 13.0.0 (CLANG: AOCC_3.2.0-Build#128 2021_11_12) (based on LLVM Mirror.Version.13.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.2.0/bin ------------------------------------------------------------------------------ Base Compiler Invocation ------------------------ C benchmarks: clang C++ benchmarks: clang++ Fortran benchmarks: flang Benchmarks using both Fortran and C: flang clang Benchmarks using both C and C++: clang++ clang Benchmarks using Fortran, C, and C++: clang++ clang flang Base Portability Flags ---------------------- 503.bwaves_r: -DSPEC_LP64 507.cactuBSSN_r: -DSPEC_LP64 508.namd_r: -DSPEC_LP64 510.parest_r: -DSPEC_LP64 511.povray_r: -DSPEC_LP64 519.lbm_r: -DSPEC_LP64 521.wrf_r: -DSPEC_CASE_FLAG -Mbyteswapio -DSPEC_LP64 526.blender_r: -funsigned-char -D__BOOL_DEFINED -DSPEC_LP64 527.cam4_r: -DSPEC_CASE_FLAG -DSPEC_LP64 538.imagick_r: -DSPEC_LP64 544.nab_r: -DSPEC_LP64 549.fotonik3d_r: -DSPEC_LP64 554.roms_r: -DSPEC_LP64 Base Optimization Flags ----------------------- C benchmarks: -m64 -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Wl,-mllvm -Wl,-enable-loop-fusion -O3 -march=znver3 -fveclib=AMDLIBM -ffast-math -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -enable-loop-fusion -z muldefs -lamdlibm -ljemalloc -lflang C++ benchmarks: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Wl,-mllvm -Wl,-enable-loop-fusion -O3 -march=znver3 -fveclib=AMDLIBM -ffast-math -mllvm -enable-partial-unswitch -mllvm -unroll-threshold=100 -finline-aggressive -flv-function-specialization -mllvm -loop-unswitch-threshold=200000 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -convert-pow-exp-to-int=false -mllvm -enable-loop-fusion -z muldefs -lamdlibm -ljemalloc -lflang Fortran benchmarks: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Wl,-mllvm -Wl,-enable-loop-fusion -Hz,1,0x1 -O3 -march=znver3 -fveclib=AMDLIBM -ffast-math -Kieee -Mrecursive -mllvm -fuse-tile-inner-loop -funroll-loops -mllvm -extra-vectorizer-passes -mllvm -lsr-in-nested-loop -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -enable-loop-fusion -mllvm -enable-loopinterchange -mllvm -compute-interchange-order -z muldefs -lamdlibm -ljemalloc -lflang Benchmarks using both Fortran and C: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Wl,-mllvm -Wl,-enable-loop-fusion -O3 -march=znver3 -fveclib=AMDLIBM -ffast-math -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -enable-loop-fusion -Hz,1,0x1 -Kieee -Mrecursive -mllvm -fuse-tile-inner-loop -funroll-loops -mllvm -extra-vectorizer-passes -mllvm -lsr-in-nested-loop -mllvm -enable-loopinterchange -mllvm -compute-interchange-order -z muldefs -lamdlibm -ljemalloc -lflang Benchmarks using both C and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Wl,-mllvm -Wl,-enable-loop-fusion -O3 -march=znver3 -fveclib=AMDLIBM -ffast-math -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -enable-loop-fusion -mllvm -enable-partial-unswitch -mllvm -unroll-threshold=100 -finline-aggressive -mllvm -loop-unswitch-threshold=200000 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -convert-pow-exp-to-int=false -z muldefs -lamdlibm -ljemalloc -lflang Benchmarks using Fortran, C, and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Wl,-mllvm -Wl,-enable-loop-fusion -O3 -march=znver3 -fveclib=AMDLIBM -ffast-math -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -enable-loop-fusion -mllvm -enable-partial-unswitch -mllvm -unroll-threshold=100 -finline-aggressive -mllvm -loop-unswitch-threshold=200000 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -convert-pow-exp-to-int=false -Hz,1,0x1 -Kieee -Mrecursive -mllvm -fuse-tile-inner-loop -funroll-loops -mllvm -lsr-in-nested-loop -mllvm -enable-loopinterchange -mllvm -compute-interchange-order -z muldefs -lamdlibm -ljemalloc -lflang Base Other Flags ---------------- C benchmarks: -Wno-unused-command-line-argument C++ benchmarks: -Wno-unused-command-line-argument Fortran benchmarks: -Wno-unused-command-line-argument Benchmarks using both Fortran and C: -Wno-unused-command-line-argument Benchmarks using both C and C++: -Wno-unused-command-line-argument Benchmarks using Fortran, C, and C++: -Wno-unused-command-line-argument Peak Compiler Invocation ------------------------ C benchmarks: clang C++ benchmarks: clang++ Fortran benchmarks: flang Benchmarks using both Fortran and C: flang clang Benchmarks using both C and C++: clang++ clang Benchmarks using Fortran, C, and C++: clang++ clang flang Peak Portability Flags ---------------------- Same as Base Portability Flags Peak Optimization Flags ----------------------- C benchmarks: 519.lbm_r: -m64 -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -ffast-math -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -lamdlibm -ljemalloc 538.imagick_r: Same as 519.lbm_r 544.nab_r: -m64 -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Ofast -march=znver3 -fveclib=AMDLIBM -ffast-math -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -lamdlibm -ljemalloc C++ benchmarks: 508.namd_r: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -ffast-math -finline-aggressive -mllvm -unroll-threshold=100 -flv-function-specialization -mllvm -enable-licm-vrp -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -lamdlibm -ljemalloc 510.parest_r: basepeak = yes Fortran benchmarks: 503.bwaves_r: basepeak = yes 549.fotonik3d_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -ffast-math -Kieee -Mrecursive -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -lamdlibm -ljemalloc -lflang 554.roms_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -ffast-math -Mrecursive -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -Hz,1,0x1 -mllvm -fuse-tile-inner-loop -lamdlibm -ljemalloc -lflang Benchmarks using both Fortran and C: 521.wrf_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -ffast-math -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -Mrecursive -lamdlibm -ljemalloc -lflang 527.cam4_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-force-vector-interleave=1 -Ofast -march=znver3 -fveclib=AMDLIBM -ffast-math -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -O3 -funroll-loops -mllvm -extra-vectorizer-passes -mllvm -lsr-in-nested-loop -Mrecursive -Hz,1,0x1 -mllvm -enable-loopinterchange -mllvm -compute-interchange-order -lamdlibm -ljemalloc -lflang Benchmarks using both C and C++: 511.povray_r: basepeak = yes 526.blender_r: basepeak = yes Benchmarks using Fortran, C, and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -ffast-math -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -unroll-threshold=100 -mllvm -loop-unswitch-threshold=200000 -finline-aggressive -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -convert-pow-exp-to-int=false -Mrecursive -lamdlibm -ljemalloc -lflang Peak Other Flags ---------------- C benchmarks: -Wno-unused-command-line-argument C++ benchmarks: -Wno-unused-command-line-argument Fortran benchmarks: -Wno-unused-command-line-argument Benchmarks using both Fortran and C: -Wno-unused-command-line-argument Benchmarks using both C and C++: -Wno-unused-command-line-argument Benchmarks using Fortran, C, and C++: -Wno-unused-command-line-argument The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2017/flags/HPE-Platform-Flags-AMD-V1.2-EPYC-revR.html http://www.spec.org/cpu2017/flags/aocc320-flags-A1.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2017/flags/HPE-Platform-Flags-AMD-V1.2-EPYC-revR.xml http://www.spec.org/cpu2017/flags/aocc320-flags-A1.xml SPEC CPU and SPECrate are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ---------------------------------------------------------------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact info@spec.org. Copyright 2017-2022 Standard Performance Evaluation Corporation Tested with SPEC CPU(R)2017 v1.1.8 on 2022-01-10 05:15:49-0500. Report generated on 2022-03-22 10:12:36 by CPU2017 text formatter v6255. Originally published on 2022-03-21.