SPEC CPU®2017 Floating Point Rate Result

Copyright 2017-2021 Standard Performance Evaluation Corporation

Tyrone Systems (Test Sponsor: Netweb Pte Ltd)

Tyrone Camarero SSA300TA-54TL
(3.90 GHz, AMD Ryzen Threadripper PRO 3955WX)

SPECrate®2017_fp_base = 11500

SPECrate®2017_fp_peak = 12500

CPU2017 License: 006042 Test Date: Oct-2021
Test Sponsor: Netweb Pte Ltd Hardware Availability: Mar-2021
Tested by: Tyrone Systems Software Availability: Aug-2020

Benchmark result graphs are available in the PDF report.

Hardware
CPU Name: AMD Ryzen Threadripper PRO 3955WX
  Max MHz: 4300
  Nominal: 3900
Enabled: 16 cores, 1 chip, 2 threads/core
Orderable: 1 chip
Cache L1: 32 KB I + 32 KB D on chip per core
  L2: 512 KB I+D on chip per core
  L3: 64 MB I+D on chip per chip, 16 MB shared / 4
cores
  Other: None
Memory: 256 GB (8 x 32 GB 2Rx4 PC4-2933P-R)
Storage: 1 x 480 GB SATA SSD
Other: None
Software
OS: Red Hat Enterprise Linux 8.0 (Ootpa)
kernel version 4.18.0-80.el8.x86_64
4.18.0-80.el8.x86_64
Compiler: C/C++/Fortran: Version 2.0.0 of AOCC
Parallel: No
Firmware: Version 1.0c released Sep-2021
File System: xfs
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 64-bit
Other: jemalloc: jemalloc memory allocator library v5.2.0
Power Management: BIOS set to prefer performance at
the cost of additional power usage.

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
SPECrate®2017_fp_base 11500
SPECrate®2017_fp_peak 12500
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
503.bwaves_r 32 1814 1770 1813 1770 1814 1770 8 429 1870 429 1870 429 1870
507.cactuBSSN_r 32 232 1750 231 1760 231 1750 32 232 1750 230 1760 230 1760
508.namd_r 32 284 1070 281 1080 283 1070 32 298 1020 296 1030 290 1050
510.parest_r 32 1596 52.4 1604 52.2 1596 52.5 16 449 93.2 448 93.3 449 93.2
511.povray_r 32 476 1570 477 1570 475 1570 32 468 1600 469 1590 476 1570
519.lbm_r 32 704 47.9 705 47.8 706 47.8 32 703 48.0 705 47.8 703 48.0
521.wrf_r 32 662 1080 661 1090 662 1080 16 289 1240 288 1240 289 1240
526.blender_r 32 322 1510 324 1500 325 1500 32 328 1490 325 1500 327 1490
527.cam4_r 32 430 1300 431 1300 430 1300 32 431 1300 431 1300 433 1290
538.imagick_r 32 178 4470 177 4490 177 4490 32 177 4500 176 4520 178 4470
544.nab_r 32 272 1980 272 1980 272 1980 32 272 1980 273 1970 275 1960
549.fotonik3d_r 32 1907 65.4 1907 65.4 1907 65.4 16 940 66.4 939 66.4 940 66.4
554.roms_r 32 1351 37.6 1348 37.7 1353 37.6 16 488 52.0 486 52.4 487 52.2

Compiler Notes

The AMD64 AOCC Compiler Suite is available at
http://developer.amd.com/amd-aocc/

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152' was used to set environment locked pages in memory limit

runcpu command invoked through numactl i.e.:
numactl --interleave=all runcpu <etc>

Set dirty_ratio=8 to limit dirty cache to 8% of memory
Set swappiness=1 to swap only if necessary
Set zone_reclaim_mode=1 to free local node memory and avoid remote memory
sync then drop_caches=3 to reset caches before invoking runcpu

dirty_ratio, swappiness, zone_reclaim_mode and drop_caches were
all set using privileged echo (e.g. echo 1 > /proc/sys/vm/swappiness).

Transparent huge pages set to 'always' for this run (OS default)

Environment Variables Notes

Environment variables set by runcpu before the start of the run:
MALLOC_CONF = "retain:true"

General Notes

Binaries were compiled on a system with 1 x AMD EPYC 3955W x CPU + 512GB Memory using Redhat 8.0

NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown)
is mitigated in the system as tested and documented.
Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1)
is mitigated in the system as tested and documented.
Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2)
is mitigated in the system as tested and documented.

jemalloc: configured and built with GCC v9.1.0 in Ubuntu 19.04 with -O3 -znver2 -flto
jemalloc 5.2.0 is available here:
https://github.com/jemalloc/jemalloc/releases/download/5.2.0/jemalloc-5.2.0.tar.bz2

Platform Notes

BIOS Settings:
Determinism Control set to Manual
Determinism Slider set to Power
cTDP Control set to Manual
cTDP set to 280
Package Power Limit Control set to Manual
Package Power Limit set to 280
APBDIS set to 1
NUMA Nodes Per Socket set to Auto

 Sysinfo program /home/cpu2017/bin/sysinfo
 Rev: r6622 of 2021-04-07 982a61ec0915b55891ef0e16acafc64d
 running on spec Sun Oct 10 00:56:41 2021

 SUT (System Under Test) info as seen by some common utilities.
 For more information on this section, see
    https://www.spec.org/cpu2017/Docs/config.html#sysinfo

 From /proc/cpuinfo
    model name : AMD Ryzen Threadripper PRO 3955WX 16-Cores
       1  "physical id"s (chips)
       32 "processors"
    cores, siblings (Caution: counting these is hw and system dependent. The following
    excerpts from /proc/cpuinfo might not be reliable.  Use with caution.)
       cpu cores : 16
       siblings  : 32
       physical 0: cores 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

 From lscpu from util-linux 2.32.1:
      Architecture:        x86_64
      CPU op-mode(s):      32-bit, 64-bit
      Byte Order:          Little Endian
      CPU(s):              32
      On-line CPU(s) list: 0-31
      Thread(s) per core:  2
      Core(s) per socket:  16
      Socket(s):           1
      NUMA node(s):        1
      Vendor ID:           AuthenticAMD
      CPU family:          23
      Model:               49
      Model name:          AMD Ryzen Threadripper PRO 3955WX 16-Cores
      Stepping:            0
      CPU MHz:             3862.860
      CPU max MHz:         3900.0000
      CPU min MHz:         2200.0000
      BogoMIPS:            7800.23
      Virtualization:      AMD-V
      L1d cache:           32K
      L1i cache:           32K
      L2 cache:            512K
      L3 cache:            16384K
      NUMA node0 CPU(s):   0-31
      Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
      pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm
      constant_tsc rep_good nopl xtopology nonstop_tsc cpuid extd_apicid aperfmperf pni
      pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c
      rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch
      osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb
      cat_l3 cdp_l3 hw_pstate sme ssbd sev ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2
      cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves
      cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr wbnoinvd
      arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists
      pausefilter pfthreshold avic v_vmsave_vmload vgif umip rdpid overflow_recov succor
      smca

 /proc/cpuinfo cache data
    cache size : 512 KB

 From numactl --hardware
 WARNING: a numactl 'node' might or might not correspond to a physical chip.
   available: 1 nodes (0)
   node 0 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
   28 29 30 31
   node 0 size: 257802 MB
   node 0 free: 252331 MB
   node distances:
   node   0
     0:  10

 From /proc/meminfo
    MemTotal:       263989408 kB
    HugePages_Total:    2048
    Hugepagesize:       2048 kB

 /sbin/tuned-adm active
     Current active profile: throughput-performance

 /sys/devices/system/cpu/cpu*/cpufreq/scaling_governor has
    performance

 From /etc/*release* /etc/*version*
    os-release:
       NAME="Red Hat Enterprise Linux"
       VERSION="8.0 (Ootpa)"
       ID="rhel"
       ID_LIKE="fedora"
       VERSION_ID="8.0"
       PLATFORM_ID="platform:el8"
       PRETTY_NAME="Red Hat Enterprise Linux 8.0 (Ootpa)"
       ANSI_COLOR="0;31"
    redhat-release: Red Hat Enterprise Linux release 8.0 (Ootpa)
    system-release: Red Hat Enterprise Linux release 8.0 (Ootpa)
    system-release-cpe: cpe:/o:redhat:enterprise_linux:8.0:ga

 uname -a:
    Linux spec 4.18.0-80.el8.x86_64 #1 SMP Wed Mar 13 12:02:46 UTC 2019 x86_64 x86_64
    x86_64 GNU/Linux

 Kernel self-reported vulnerability status:

 CVE-2018-12207 (iTLB Multihit):                        No status reported
 CVE-2018-3620 (L1 Terminal Fault):                     Not affected
 Microarchitectural Data Sampling:                      No status reported
 CVE-2017-5754 (Meltdown):                              Not affected
 CVE-2018-3639 (Speculative Store Bypass):              Mitigation: Speculative Store
                                                        Bypass disabled via prctl and
                                                        seccomp
 CVE-2017-5753 (Spectre variant 1):                     Mitigation: __user pointer
                                                        sanitization
 CVE-2017-5715 (Spectre variant 2):                     Mitigation: Full AMD retpoline,
                                                        IBPB: conditional, STIBP:
                                                        conditional, RSB filling
 CVE-2020-0543 (Special Register Buffer Data Sampling): No status reported
 CVE-2019-11135 (TSX Asynchronous Abort):               No status reported

 run-level 3 Oct 9 17:36

 SPEC is set to: /home/cpu2017
    Filesystem            Type  Size  Used Avail Use% Mounted on
    /dev/mapper/rhel-home xfs   392G  6.1G  386G   2% /home

 From /sys/devices/virtual/dmi/id
     Vendor:         Tyrone Systems
     Product:        SSA300TA-54TL
     Product Family: AMD
     Serial:         5O20142109

 Additional information from dmidecode 3.2 follows.  WARNING: Use caution when you
 interpret this section. The 'dmidecode' program reads system data which is "intended to
 allow hardware to be accurately determined", but the intent may not be met, as there are
 frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard.
   Memory:
     8x Samsung M393A4K40DB2-CVF 32 GB 2 rank 2933

 BIOS:
    BIOS Vendor:       American Megatrends International, LLC.
    BIOS Version:      1.0c
    BIOS Date:         09/10/2021
    BIOS Revision:     1.12

 (End of data from sysinfo program)

Compiler Version Notes

==============================================================================
C               | 519.lbm_r(base, peak) 538.imagick_r(base, peak)
                | 544.nab_r(base, peak)
------------------------------------------------------------------------------
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
------------------------------------------------------------------------------

==============================================================================
C++             | 508.namd_r(base, peak) 510.parest_r(base, peak)
------------------------------------------------------------------------------
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
------------------------------------------------------------------------------

==============================================================================
C++, C          | 511.povray_r(base, peak) 526.blender_r(base, peak)
------------------------------------------------------------------------------
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
------------------------------------------------------------------------------

==============================================================================
C++, C, Fortran | 507.cactuBSSN_r(base, peak)
------------------------------------------------------------------------------
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
------------------------------------------------------------------------------

==============================================================================
Fortran         | 503.bwaves_r(base, peak) 549.fotonik3d_r(base, peak)
                | 554.roms_r(base, peak)
------------------------------------------------------------------------------
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
------------------------------------------------------------------------------

==============================================================================
Fortran, C      | 521.wrf_r(base, peak) 527.cam4_r(base, peak)
------------------------------------------------------------------------------
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
AOCC.LLVM.2.0.0.B191.2019_07_19 clang version 8.0.0 (CLANG: Jenkins
  AOCC_2_0_0-Build#191) (based on LLVM AOCC.LLVM.2.0.0.B191.2019_07_19)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/aocc-compiler-2.0.0/bin
------------------------------------------------------------------------------

Base Compiler Invocation

C benchmarks:

 clang 

C++ benchmarks:

 clang++ 

Fortran benchmarks:

 flang 

Benchmarks using both Fortran and C:

 flang   clang 

Benchmarks using both C and C++:

 clang++   clang 

Benchmarks using Fortran, C, and C++:

 clang++   clang   flang 

Base Portability Flags

503.bwaves_r:  -DSPEC_LP64 
507.cactuBSSN_r:  -DSPEC_LP64 
508.namd_r:  -DSPEC_LP64 
510.parest_r:  -DSPEC_LP64 
511.povray_r:  -DSPEC_LP64 
519.lbm_r:  -DSPEC_LP64 
521.wrf_r:  -DSPEC_CASE_FLAG   -Mbyteswapio   -DSPEC_LP64 
526.blender_r:  -funsigned-char   -D__BOOL_DEFINED   -DSPEC_LP64 
527.cam4_r:  -DSPEC_CASE_FLAG   -DSPEC_LP64 
538.imagick_r:  -DSPEC_LP64 
544.nab_r:  -DSPEC_LP64 
549.fotonik3d_r:  -DSPEC_LP64 
554.roms_r:  -DSPEC_LP64 

Base Optimization Flags

C benchmarks:

 -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -O3   -ffast-math   -march=znver2   -fstruct-layout=3   -mllvm -unroll-threshold=50   -fremap-arrays   -mllvm -function-specialize   -mllvm -enable-gvn-hoist   -mllvm -reduce-array-computations=3   -mllvm -global-vectorize-slp   -mllvm -vector-library=LIBMVEC   -mllvm -inline-threshold=1000   -flv-function-specialization   -z muldefs   -lmvec   -lamdlibm   -ljemalloc   -lflang 

C++ benchmarks:

 -std=c++98   -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-mllvm -Wl,-suppress-fmas   -O3   -ffast-math   -march=znver2   -mllvm -loop-unswitch-threshold=200000   -mllvm -vector-library=LIBMVEC   -mllvm -unroll-threshold=100   -flv-function-specialization   -mllvm -enable-partial-unswitch   -z muldefs   -lmvec   -lamdlibm   -ljemalloc   -lflang 

Fortran benchmarks:

 -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -O3   -march=znver2   -funroll-loops   -Mrecursive   -mllvm -vector-library=LIBMVEC   -z muldefs   -Kieee   -fno-finite-math-only   -lmvec   -lamdlibm   -ljemalloc   -lflang 

Benchmarks using both Fortran and C:

 -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -O3   -ffast-math   -march=znver2   -fstruct-layout=3   -mllvm -unroll-threshold=50   -fremap-arrays   -mllvm -function-specialize   -mllvm -enable-gvn-hoist   -mllvm -reduce-array-computations=3   -mllvm -global-vectorize-slp   -mllvm -vector-library=LIBMVEC   -mllvm -inline-threshold=1000   -flv-function-specialization   -funroll-loops   -Mrecursive   -z muldefs   -Kieee   -fno-finite-math-only   -lmvec   -lamdlibm   -ljemalloc   -lflang 

Benchmarks using both C and C++:

 -std=c++98   -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-mllvm -Wl,-suppress-fmas   -O3   -ffast-math   -march=znver2   -fstruct-layout=3   -mllvm -unroll-threshold=50   -fremap-arrays   -mllvm -function-specialize   -mllvm -enable-gvn-hoist   -mllvm -reduce-array-computations=3   -mllvm -global-vectorize-slp   -mllvm -vector-library=LIBMVEC   -mllvm -inline-threshold=1000   -flv-function-specialization   -mllvm -loop-unswitch-threshold=200000   -mllvm -unroll-threshold=100   -mllvm -enable-partial-unswitch   -z muldefs   -lmvec   -lamdlibm   -ljemalloc   -lflang 

Benchmarks using Fortran, C, and C++:

 -std=c++98   -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-mllvm -Wl,-suppress-fmas   -O3   -ffast-math   -march=znver2   -fstruct-layout=3   -mllvm -unroll-threshold=50   -fremap-arrays   -mllvm -function-specialize   -mllvm -enable-gvn-hoist   -mllvm -reduce-array-computations=3   -mllvm -global-vectorize-slp   -mllvm -vector-library=LIBMVEC   -mllvm -inline-threshold=1000   -flv-function-specialization   -mllvm -loop-unswitch-threshold=200000   -mllvm -unroll-threshold=100   -mllvm -enable-partial-unswitch   -funroll-loops   -Mrecursive   -z muldefs   -Kieee   -fno-finite-math-only   -lmvec   -lamdlibm   -ljemalloc   -lflang 

Peak Compiler Invocation

C benchmarks:

 clang 

C++ benchmarks:

 clang++ 

Fortran benchmarks:

 flang 

Benchmarks using both Fortran and C:

 flang   clang 

Benchmarks using both C and C++:

 clang++   clang 

Benchmarks using Fortran, C, and C++:

 clang++   clang   flang 

Peak Portability Flags

Same as Base Portability Flags

Peak Optimization Flags

C benchmarks:

 -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Ofast   -march=znver2   -mno-sse4a   -fstruct-layout=5   -mllvm -vectorize-memory-aggressively   -mllvm -function-specialize   -mllvm -enable-gvn-hoist   -mllvm -unroll-threshold=50   -fremap-arrays   -mllvm -vector-library=LIBMVEC   -mllvm -reduce-array-computations=3   -mllvm -global-vectorize-slp   -mllvm -inline-threshold=1000   -flv-function-specialization   -lmvec   -lamdlibm   -ljemalloc   -lflang 

C++ benchmarks:

508.namd_r:  -std=c++98   -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Ofast   -march=znver2   -flv-function-specialization   -mllvm -unroll-threshold=100   -mllvm -enable-partial-unswitch   -mllvm -loop-unswitch-threshold=200000   -mllvm -vector-library=LIBMVEC   -mllvm -inline-threshold=1000   -lmvec   -lamdlibm   -ljemalloc   -lflang 
510.parest_r:  -std=c++98   -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-mllvm -Wl,-suppress-fmas   -Ofast   -march=znver2   -flv-function-specialization   -mllvm -unroll-threshold=100   -mllvm -enable-partial-unswitch   -mllvm -loop-unswitch-threshold=200000   -mllvm -vector-library=LIBMVEC   -mllvm -inline-threshold=1000   -lmvec   -lamdlibm   -ljemalloc   -lflang 

Fortran benchmarks:

503.bwaves_r:  -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -O3   -march=znver2   -funroll-loops   -Mrecursive   -mllvm -vector-library=LIBMVEC   -Kieee   -fno-finite-math-only   -lmvec   -lamdlibm   -ljemalloc   -lflang 
549.fotonik3d_r:  Same as 503.bwaves_r 
554.roms_r:  -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-mllvm -Wl,-enable-X86-prefetching   -O3   -march=znver2   -funroll-loops   -Mrecursive   -mllvm -vector-library=LIBMVEC   -Kieee   -fno-finite-math-only   -lmvec   -lamdlibm   -ljemalloc   -lflang 

Benchmarks using both Fortran and C:

 -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Ofast   -march=znver2   -mno-sse4a   -fstruct-layout=5   -mllvm -vectorize-memory-aggressively   -mllvm -function-specialize   -mllvm -enable-gvn-hoist   -mllvm -unroll-threshold=50   -fremap-arrays   -mllvm -vector-library=LIBMVEC   -mllvm -reduce-array-computations=3   -mllvm -global-vectorize-slp   -mllvm -inline-threshold=1000   -flv-function-specialization   -O3   -funroll-loops   -Mrecursive   -Kieee   -fno-finite-math-only   -lmvec   -lamdlibm   -ljemalloc   -lflang 

Benchmarks using both C and C++:

511.povray_r:  -std=c++98   -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Wl,-mllvm -Wl,-x86-use-vzeroupper=false   -Ofast   -march=znver2   -mno-sse4a   -fstruct-layout=5   -mllvm -vectorize-memory-aggressively   -mllvm -function-specialize   -mllvm -enable-gvn-hoist   -mllvm -unroll-threshold=50   -fremap-arrays   -mllvm -vector-library=LIBMVEC   -mllvm -reduce-array-computations=3   -mllvm -global-vectorize-slp   -mllvm -inline-threshold=1000   -flv-function-specialization   -mllvm -unroll-threshold=100   -mllvm -enable-partial-unswitch   -mllvm -loop-unswitch-threshold=200000   -lmvec   -lamdlibm   -ljemalloc   -lflang 
526.blender_r:  -std=c++98   -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Ofast   -march=znver2   -mno-sse4a   -fstruct-layout=5   -mllvm -vectorize-memory-aggressively   -mllvm -function-specialize   -mllvm -enable-gvn-hoist   -mllvm -unroll-threshold=50   -fremap-arrays   -mllvm -vector-library=LIBMVEC   -mllvm -reduce-array-computations=3   -mllvm -global-vectorize-slp   -mllvm -inline-threshold=1000   -flv-function-specialization   -mllvm -unroll-threshold=100   -mllvm -enable-partial-unswitch   -mllvm -loop-unswitch-threshold=200000   -lmvec   -lamdlibm   -ljemalloc   -lflang 

Benchmarks using Fortran, C, and C++:

 -std=c++98   -flto   -Wl,-mllvm -Wl,-function-specialize   -Wl,-mllvm -Wl,-region-vectorize   -Wl,-mllvm -Wl,-vector-library=LIBMVEC   -Wl,-mllvm -Wl,-reduce-array-computations=3   -Ofast   -march=znver2   -mno-sse4a   -fstruct-layout=5   -mllvm -vectorize-memory-aggressively   -mllvm -function-specialize   -mllvm -enable-gvn-hoist   -mllvm -unroll-threshold=50   -fremap-arrays   -mllvm -vector-library=LIBMVEC   -mllvm -reduce-array-computations=3   -mllvm -global-vectorize-slp   -mllvm -inline-threshold=1000   -flv-function-specialization   -mllvm -unroll-threshold=100   -mllvm -enable-partial-unswitch   -mllvm -loop-unswitch-threshold=200000   -O3   -funroll-loops   -Mrecursive   -Kieee   -fno-finite-math-only   -lmvec   -lamdlibm   -ljemalloc   -lflang 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2017/flags/aocc200-flags-C4.2021-12-01.html,
http://www.spec.org/cpu2017/flags/Tyrone-Platform-Settings-V1.2-Milan-revB.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2017/flags/aocc200-flags-C4.2021-12-01.xml,
http://www.spec.org/cpu2017/flags/Tyrone-Platform-Settings-V1.2-Milan-revB.xml.