# Invocation command line: # /home/cpu2017/bin/harness/runcpu --configfile amd_speed_aocc320_milanx_A1.cfg --tune all --reportable --iterations 3 --nopower --runmode speed --tune base:peak --size test:train:refspeed intspeed # output_root was not used for this run ############################################################################ ################################################################################ # AMD AOCC 320 SPEC CPU2017 V1.1.8 Speed Configuration File for 64-bit Linux # # File name : amd_speed_aocc320_milanx_A1.cfg # Creation Date : February 11, 2022 # CPU2017 Version : 1.1.8 # Supported benchmarks : All Speed benchmarks (intspeed, fpspeed) # Compiler name/version : AOCC 3.2.0 # Operating system version : OpenSUSE 15.2 # Supported OS's : Ubuntu 20.04, RHEL 8.3, SLES 15 SP2 # Hardware : AMD Milan, Rome, (AMD64) # FP Base Pointer Size : 64-bit # FP Peak Pointer Size : 64-bit # INT Base Pointer Size : 64-bit # INT Peak Pointer Size : 64-bit # Auto Parallization : No # # Note: DO NOT EDIT THIS FILE, the only edits required to properly run these # binaries are made in the ini Python file. Please consult Readme.amd_speed_aocc320_milanx_A1.txt # for a few uncommon exceptions which require edits to this file. # # Description: # # This binary package automates away many of the complexities necessary to set # up and run SPEC CPU2017 under optimized conditions on AMD Milan-X/Milan/Rome-based # server platforms within Linux (AMD64). # # The binary package was built specifically for AMD Milan-X/Milan/Rome microprocessors and # is not intended to run on other products. # # Please install the binary package by following the instructions in # "Readme.amd_speed_aocc320_milanx_A1.txt" under the "How To Use the Binaries" section. # # The binary package is designed to work without alteration on two socket AMD # Milan-X/Milan/Rome-based servers with 64 cores per socket, SMT enabled and 1 TiB of DDR4 # memory distributed evenly among all 16 channels using 32 GiB DIMMs. # # To run the binary package on other Milan-X/Milan/Rome configurations, please review # "Readme.amd_speed_aocc320_milanx_A1.txt". In general, Milan-X/Milan/Rome CPUs # should be autodetected with no action required by the user. # # In most cases, it should be unnecessary to edit "amd_speed_aocc320_milanx_A1.cfg" or any # other file besides "ini_amd_speed_aocc320_milanx_A1.py" where reporting fields # and run conditions are set. # # The run script automatically sets the optimal number of speed copies and binds # them appropriately. # # The run script and accompanying binary package are designed to work on Ubuntu # 20.04, RHEL 8.3 and SLES 15 SP2. # # Important! If you write your own run script, please set the stack size to # "unlimited" when executing this binary package. Failure to do so may cause # some benchmarks to overflow the stack. For example, to set stack size within # the bash shell, include the following line somewhere at the top of your run # script before the runcpu invocation: # # ulimit -s unlimited # # Modification of this config file should only be necessary if you intend to # rebuild the binaries. General instructions for rebuilding the binaries are # found in-line below. # ################################################################################ # Modifiable macros: ################################################################################ # Change the following line to true if you intend to REBUILD the binaries (AMD # does not support this). Valid values are "true" or "false" (no quotes). %define allow_build false # Only change these macros if you are rebuilding the binary package: %define compiler_name aocc320 %define binary_package_name amd_speed_%{compiler_name}_milanx_A %define binary_package_revision 1 # build_path cannot contain SPEC variables or it will trigger rebuilds: %define build_path /sppo/bin/cpu2017v118-aocc32-milanx-speed %define flags_file_name %{compiler_name}-flags-A1.xml # To enable the platform file, be sure to uncomment the flagsurl02 header line # below. %define platform_file_name INVALID_platform_%{binary_package_name}.xml # You should never have to change binary_package_full_name: %define binary_package_full_name %{binary_package_name}%{binary_package_revision} ################################################################################ ################################################################################ # Include file name ################################################################################ # The include file contains fields that are commonly changed. This file is auto- # generated based upon INI file settings and should not need user modification # for runs. %define inc_file_name %{binary_package_full_name}.inc ################################################################################ ################################################################################ # Binary label extension and "allow_build"" switch ################################################################################ # Only modify the binary label extension if you plan to rebuild the binaries. %define ext %{binary_package_name} # If you plan to recompile these CPU2017 binaries, please choose a new extension # name (ext above) to avoid confusion with the current binary set on your system # under test, and to avoid confusion for SPEC submission reviewers. You will # also need to set "allow_build" to true below. Finally, you must modify the # Paths section below to point to your library locations if the paths are not # already set up in your build environment. ################################################################################ ################################################################################ # Paths and Environment Variables # ** MODIFY AS NEEDED (modification should not be necessary for runs) ** ################################################################################ # Allow environment variables to be set before runs: preenv = 1 # Necessary to avoid gcc out-of-memory exceptions on certain SUTs: preENV_MALLOC_CONF = retain:true preENV_LIBOMP_NUM_HIDDEN_HELPER_THREADS = 0 # OpenMP environment variables: preENV_OMP_SCHEDULE = static preENV_OMP_DYNAMIC = false preENV_OMP_STACKSIZE = 128M # Define the name of the directory that holds AMD library files: %define lib_dir %{binary_package_name}_lib %define build_lib_dir %{binary_package_name}_lib # Set the shared object library path for runs and builds: preENV_LD_LIBRARY_PATH = $[top]/%{lib_dir}/lib;$[top]/%{lib_dir}/lib32:%{ENV_LD_LIBRARY_PATH} # Define 32-bit library build paths: # Do not use $[top] with the 32-bit libraries because doing so will cause an # options checksum error triggering a xalanc recompile attempt on SUTs having # different file paths. # NOTE: no 32-bit libraries are currently needed with Speed. JEMALLOC_LIB32_PATH = %{build_path}%{build_lib_dir}/lib32 %if '%{allow_build}' eq 'false' # The include file is only needed for runs, but not for builds. # include: %{inc_file_name} # ----- Begin inclusion of 'amd_speed_aocc320_milanx_A1.inc' ############################################################################ ################################################################################ ################################################################################ # File name: amd_speed_aocc320_milanx_A1.inc # File generation code date: May 4, 2021 # File generation date/time: October 07, 2022 / 08:40:09 # # This file is automatically generated during a SPEC CPU2017 run. # # To modify inc file generation, please consult the readme file or the run # script. ################################################################################ ################################################################################ ################################################################################ ################################################################################ # The following macros are generated for use in the cfg file. ################################################################################ ################################################################################ %define logical_core_count 128 %define physical_core_count 128 %define physical_core_max 127 %define logical_core_max 127 ################################################################################ ################################################################################ # The following inc blocks set the speed thread counts and affinity settings. # # intspeed benchmarks: 600.perlbench_s,602.gcc_s,605.mcf_s,620.omnetpp_s, # 623.xalancbmk_s,625.x264_s,631.deepsjeng_s,641.leela_s,648.exchange2_s, # 657.xz_s # fpspeed benchmarks: 603.bwaves_s,607.cactuBSSN_s,619.lbm_s,621.wrf_s, # 627.cam4_s,628.pop2_s,638.imagick_s,644.nab_s,649.fotonik3d_s, # 654.roms_s # # Selected thread counts from '64p' section of CPU info ################################################################################ # default preENV thread settings: default: preENV_OMP_THREAD_LIMIT = 128 preENV_GOMP_CPU_AFFINITY = 0-127 ################################################################################ ################################################################################ # intspeed base thread counts: intspeed=base: threads = 128 ENV_GOMP_CPU_AFFINITY = 0-127 bind0 = numactl --physcpubind=0-127 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # fpspeed base thread counts: fpspeed=base: threads = 128 ENV_GOMP_CPU_AFFINITY = 0-127 bind0 = numactl --physcpubind=0-127 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # peak thread counts: 1 600.perlbench_s,602.gcc_s,605.mcf_s,620.omnetpp_s,623.xalancbmk_s,625.x264_s,631.deepsjeng_s,641.leela_s,648.exchange2_s=peak: threads = 1 ENV_GOMP_CPU_AFFINITY = 0 bind0 = numactl --physcpubind=0 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # peak thread counts: 128 603.bwaves_s,607.cactuBSSN_s,619.lbm_s,621.wrf_s,627.cam4_s,628.pop2_s,638.imagick_s,649.fotonik3d_s,654.roms_s,657.xz_s=peak: threads = 128 ENV_GOMP_CPU_AFFINITY = 0-127 bind0 = numactl --physcpubind=0-127 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ # peak thread counts: 128 644.nab_s=peak: threads = 128 ENV_GOMP_CPU_AFFINITY = 0-127 bind0 = numactl --physcpubind=0-127 submit = echo "$command" > run.sh ; $BIND bash run.sh ################################################################################ ################################################################################ ################################################################################ # Switch back to default: default: ################################################################################ ################################################################################ ################################################################################ # The remainder of this file defines CPU2017 report parameters. ################################################################################ ################################################################################ ################################################################################ # SPEC CPU 2017 report header ################################################################################ license_num =9019 tester =Cisco Systems test_sponsor =Cisco Systems hw_vendor =Cisco Systems hw_model000 =Cisco UCS C225 M6 (AMD EPYC 7662) #--------- If you install new compilers, edit this section -------------------- sw_compiler =C/C++/Fortran: Version 3.2.0 of AOCC ################################################################################ ################################################################################ # Hardware, firmware and software information ################################################################################ hw_avail =Aug-2021 sw_avail =Dec-2021 hw_cpu_name =AMD EPYC 7662 hw_cpu_nominal_mhz =2000 hw_cpu_max_mhz =3300 hw_ncores =128 hw_nthreadspercore =1 hw_ncpuorder =1,2 chips hw_other =None # Other perf-relevant hw, or "None" fw_bios =Version 4.2.2b released May-2022 sw_base_ptrsize =64-bit hw_pcache =32 KB I + 32 KB D on chip per core hw_scache =512 KB I+D on chip per core hw_tcache000 =256 MB I+D on chip per chip, 16 MB shared / 4 hw_tcache001 = cores hw_ocache =None ################################################################################ # Notes ################################################################################ # Enter notes_000 through notes_100 here. notes_000 =Binaries were compiled on a system with 2x AMD EPYC 7742 CPU + 1TiB Memory using openSUSE 15.2 notes_005 = notes_010 =NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) notes_015 =is mitigated in the system as tested and documented. notes_020 =Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) notes_025 =is mitigated in the system as tested and documented. notes_030 =Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) notes_035 =is mitigated in the system as tested and documented. notes_040 = notes_submit_000 ='numactl' was used to bind copies to the cores. notes_submit_005 =See the configuration file for details. notes_os_000 ='ulimit -s unlimited' was used to set environment stack size limit notes_os_005 ='ulimit -l 2097152' was used to set environment locked pages in memory limit notes_os_010 = notes_os_015 =runcpu command invoked through numactl i.e.: notes_os_020 =numactl --interleave=all runcpu notes_os_025 = notes_os_030 =To limit dirty cache to 8% of memory, 'sysctl -w vm.dirty_ratio=8' run as root. notes_os_035 =To limit swap usage to minimum necessary, 'sysctl -w vm.swappiness=1' run as root. notes_os_040 =To free node-local memory and avoid remote memory usage, notes_os_045 ='sysctl -w vm.zone_reclaim_mode=1' run as root. notes_os_050 =To clear filesystem caches, 'sync; sysctl -w vm.drop_caches=3' run as root. notes_os_055 =To disable address space layout randomization (ASLR) to reduce run-to-run notes_os_060 =variability, 'sysctl -w kernel.randomize_va_space=0' run as root. notes_os_065 = notes_os_thp_000 =To enable Transparent Hugepages (THP) for all allocations, notes_os_thp_005 ='echo always > /sys/kernel/mm/transparent_hugepage/enabled' and notes_os_thp_010 ='echo always > /sys/kernel/mm/transparent_hugepage/defrag' run as root. notes_comp_000 =The AMD64 AOCC Compiler Suite is available at notes_comp_005 =http://developer.amd.com/amd-aocc/ notes_comp_010 = notes_jemalloc_000 =jemalloc: configured and built with GCC v4.8.2 in RHEL 7.4 (No options specified) notes_jemalloc_005 =jemalloc 5.1.0 is available here: notes_jemalloc_010 =https://github.com/jemalloc/jemalloc/releases/download/5.1.0/jemalloc-5.1.0.tar.bz2 notes_jemalloc_015 = sw_other =jemalloc: jemalloc memory allocator library v5.1.0 ################################################################################ # The following note fields describe platorm settings. ################################################################################ # example: (uncomment as necessary) # notes_plat_000 =BIOS settings: # notes_plat_002 = cTDP: 280 # notes_plat_004 = Determinism Slider set to Power # notes_plat_006 = Package Power: 280 # notes_plat_008 = EDC: 300 # notes_plat_010 = NPS: 1 # notes_plat_014 = 4-link xGMI max speed: 16Gbps # notes_plat_015 = Fan Speed: Maximum ################################################################################ # The following are custom fields: ################################################################################ # Use custom_fields to enter lines that are not listed here. For example: # notes_plat_100 = Energy Bias set to Max Performance # new_field = Ambient temperature set to 10C ################################################################################ # The following fields must be set here for only Int benchmarks. ################################################################################ intspeed: sw_peak_ptrsize =64-bit notes_os_thp_015 = ################################################################################ # The following fields must be set here for FP benchmarks. ################################################################################ fpspeed: sw_peak_ptrsize =64-bit notes_os_thp_003 =To enable THP only on request for peak runs of 628.pop2_s: notes_os_thp_004 ='echo madvise > /sys/kernel/mm/transparent_hugepage/enabled' run as root. notes_os_thp_005 =To disable THP for peak runs of 627.cam4_s, 649.fotonik3d_s, and 654.roms_s, notes_os_thp_006 ='echo never > /sys/kernel/mm/transparent_hugepage/enabled' run as root. notes_os_thp_007 = ################################################################################ # The following fields must be set here or they will be overwritten by sysinfo. ################################################################################ intspeed,fpspeed: hw_disk =1 x 960 GB M.2 SSD SATA hw_memory000 =2 TB (16 x 128 GB 4Rx4 PC4-3200AA-L) hw_memory002 = hw_nchips =2 prepared_by =Cisco Systems sw_file =xfs sw_os000 =SUSE Linux Enterprise Server 15 SP2 (x86_64) sw_os001 =kernel version # ex: Kernel 4.4.0-87-generic sw_state =Run level 3 (multi-user) ################################################################################ # End of inc file ################################################################################ # Switch back to the default block after the include file: default: # ---- End inclusion of '/home/cpu2017/config/amd_speed_aocc320_milanx_A1.inc' # Switch back to default block after the include file: default: fail_build = 1 %elif '%{allow_build}' eq 'true' # If you intend to rebuild, be sure to set the library paths either in the # build script or here: preENV_LIBRARY_PATH = $[top]/%{build_lib_dir}/lib;$[top]/%{build_lib_dir}/lib32:%{ENV_LIBRARY_PATH} % define build_ncpus 64 # controls number of simultaneous compiles fail_build = 0 makeflags = --jobs=%{build_ncpus} --load-average=%{build_ncpus} %else % error The value of "allow_build" is %{allow_build}, but it can only be "true" or "false". This error was generated %endif ################################################################################ ################################################################################ # Enable automated data collection per benchmark ################################################################################ # Data collection is not enabled for reportable runs. # teeout is necessary to get data collection stdout into the logs. Best # practices for the individual data collection items would be to have # them store important output in separate files. Filenames could be # constructed from $SPEC (environment), $lognum (result number from runcpu), # and benchmark name/number. teeout = yes # Run runcpu with '-v 35' (or greater) to log lists of variables which can # be used in substitutions as below. # For CPU2006, change $label to $ext %define data-collection-parameters benchname='$name' benchnum='$num' benchmark='$benchmark' iteration=$iter size='$size' tune='$tune' label='$label' log='$log' lognum='$lognum' from_runcpu='$from_runcpu' %define data-collection-start $[top]/data-collection/data-collection start %{data-collection-parameters} %define data-collection-stop $[top]/data-collection/data-collection stop %{data-collection-parameters} monitor_specrun_wrapper = %{data-collection-start} ; $command ; %{data-collection-stop} ################################################################################ ################################################################################ # Header settings ################################################################################ backup_config = 0 # set to 0 if you do not want backup files bench_post_setup = sync # command_add_redirect: If set, the generated ${command} will include # redirection operators (stdout, stderr), which are passed along to the shell # that executes the command. If this variable is not set, specinvoke does the # redirection. command_add_redirect = yes env_vars = yes flagsurl000 = http://www.spec.org/cpu2017/flags/aocc320-flags-A1.xml flagsurl001 = http://www.spec.org/cpu2017/flags/Cisco-Platform-Settings-AMD-v2-revD.xml #flagsurl02 = $[top]/%{platform_file_name} # label: User defined extension string that tags your binaries & directories: label = %{ext} line_width = 1020 log_line_width = 1020 mean_anyway = yes output_format = all reportable = yes size = test,train,ref teeout = yes teerunout = yes tune = base,peak use_submit_for_speed = yes ################################################################################ ################################################################################ # Compilers ################################################################################ default: CC = clang -m64 CXX = clang++ -m64 FC = flang -m64 CLD = clang -m64 CXXLD = clang++ -m64 FLD = flang -m64 CC_VERSION_OPTION = --version CXX_VERSION_OPTION = --version FC_VERSION_OPTION = --version ################################################################################ ################################################################################ # Default Flags ################################################################################ default: # SPEC CPU flags: EXTRA_LIBS = -fopenmp=libomp -lomp -ljemalloc -lamdlibm -lm MATHLIBOPT = #clearing this variable or else SPEC will set it to -lm VECMATHLIB = -fveclib=AMDLIBM # AOCC option variables: OPT_ROOT = -march=znver3 $(VECMATHLIB) -ffast-math -fopenmp OPT_ROOT_BASE = -O3 $(OPT_ROOT) OPT_ROOT_PEAK = -Ofast $(OPT_ROOT) -flto ################################################################################ ################################################################################ # Portability Flags ################################################################################ default: # data model applies to all benchmarks: EXTRA_PORTABILITY = -DSPEC_LP64 # *** Benchmark-specific portability *** # Anything other than the data model is only allowed where a need is proven. # (ordered by last 2 digits of benchmark number) 600.perlbench_s: #lang='C' PORTABILITY = -DSPEC_LINUX_X64 621.wrf_s: #lang='F,C' CPORTABILITY = -DSPEC_CASE_FLAG FPORTABILITY = -Mbyteswapio 623.xalancbmk_s: #lang='CXX' PORTABILITY = -DSPEC_LINUX 627.cam4_s: #lang='F,C' PORTABILITY = -DSPEC_CASE_FLAG 628.pop2_s: #lang='F,C' CPORTABILITY = -DSPEC_CASE_FLAG FPORTABILITY = -Mbyteswapio ################################################################################ ################################################################################ # Tuning Flags ################################################################################ ##################### # Base tuning flags # ##################### default=base: COPTIMIZE = $(OPT_ROOT_BASE) -flto -fstruct-layout=5 \ -mllvm -unroll-threshold=50 \ -mllvm -inline-threshold=1000 -fremap-arrays \ -mllvm -function-specialize -flv-function-specialization \ -mllvm -enable-gvn-hoist \ -mllvm -global-vectorize-slp=true \ -mllvm -enable-licm-vrp \ -mllvm -reduce-array-computations=3 \ -Wno-unused-command-line-argument CXXOPTIMIZE = $(OPT_ROOT_BASE) -flto \ -mllvm -enable-partial-unswitch \ -mllvm -unroll-threshold=100 \ -finline-aggressive -flv-function-specialization \ -mllvm -loop-unswitch-threshold=200000 \ -mllvm -reroll-loops \ -mllvm -aggressive-loop-unswitch \ -mllvm -extra-vectorizer-passes \ -mllvm -reduce-array-computations=3 \ -mllvm -global-vectorize-slp=true \ -Wno-unused-command-line-argument \ -mllvm -convert-pow-exp-to-int=false FOPTIMIZE = -Hz,1,0x1 $(OPT_ROOT_BASE) -Mrecursive \ -mllvm -fuse-tile-inner-loop -funroll-loops \ -mllvm -extra-vectorizer-passes \ -mllvm -lsr-in-nested-loop \ -mllvm -enable-licm-vrp \ -mllvm -reduce-array-computations=3 \ -mllvm -global-vectorize-slp=true \ -Wno-unused-command-line-argument \ -mllvm -enable-loopinterchange \ -mllvm -compute-interchange-order LDCXXFLAGS = -Wl,-mllvm -Wl,-x86-use-vzeroupper=false EXTRA_LDFLAGS = -Wl,-mllvm -Wl,-region-vectorize \ -Wl,-mllvm -Wl,-function-specialize \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDFFLAGS = -Wl,-mllvm -Wl,-enable-X86-prefetching \ -Wl,-mllvm -Wl,-enable-licm-vrp #other libraries # Put OpenMP and math libraries here: # -lm needed at the end for some transcendental functions: EXTRA_LIBS = -fopenmp=libomp -lomp -lamdlibm -ljemalloc -lflang -lm EXTRA_FLIBS = # Don't put the AMD and mvec math libraries in MATHLIBOPT because it will trigger a reporting issue # because GCC won't use them. Forcefeed all benchmarks the math libraries in EXTRA_LIBS and clear # out MATHLIBOPT. MATHLIBOPT = # The following is necessary for 502/602 gcc: LDOPTIMIZE = -z muldefs EXTRA_OPTIMIZE = -DSPEC_OPENMP -fopenmp -Wno-return-type ################################################################################ ######################## # intspeed tuning flags # ######################## intspeed: FOPTIMIZE = $(OPT_ROOT_BASE) -flto EXTRA_FFLAGS = -mllvm -unroll-aggressive \ -mllvm -unroll-threshold=150 EXTRA_CXXFLAGS = -fvirtual-function-elimination -fvisibility=hidden LDCFLAGS = -Wl,-allow-multiple-definition -Wl,-mllvm \ -Wl,-enable-licm-vrp # LDCXXFLAGS is left empty as intspeed CPP bmks have to use VZEROUPPER # instruction which is the default. LDCXXFLAGS = LDFFLAGS = -Wl,-mllvm -Wl,-inline-recursion=4 \ -Wl,-mllvm -Wl,-lsr-in-nested-loop \ -Wl,-mllvm -Wl,-enable-iv-split # Setting submit in base and peak to identical values must be done due to a # quirk in the SPEC CPU harness. Do not try to move this assignment to intspeed # or default. intspeed=base: submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command intspeed=peak: submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ######################## # fpspeed tuning flags # ######################## # Setting submit in base and peak to identical values must be done due to a # quirk in the SPEC CPU harness. Do not try to move this assignment to fpspeed # or default. fpspeed=base: submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command fpspeed=peak: submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ##################### # Peak tuning flags # ##################### default=peak: COPTIMIZE = $(OPT_ROOT_PEAK) -fstruct-layout=5 \ -mllvm -unroll-threshold=50 -fremap-arrays \ -flv-function-specialization -mllvm \ -inline-threshold=1000 -mllvm -enable-gvn-hoist \ -mllvm -global-vectorize-slp=true -mllvm \ -function-specialize -mllvm -enable-licm-vrp \ -mllvm -reduce-array-computations=3 \ -Wno-unused-command-line-argument CXXOPTIMIZE = $(OPT_ROOT_PEAK) -finline-aggressive \ -mllvm -unroll-threshold=100 \ -flv-function-specialization -mllvm -enable-licm-vrp \ -mllvm -reroll-loops -mllvm \ -aggressive-loop-unswitch -mllvm \ -reduce-array-computations=3 -mllvm \ -global-vectorize-slp=true \ -Wno-unused-command-line-argument FOPTIMIZE = $(OPT_ROOT_PEAK) -Mrecursive \ -mllvm -reduce-array-computations=3 \ -mllvm -global-vectorize-slp=true \ -mllvm -enable-licm-vrp \ -Wno-unused-command-line-argument EXTRA_CXXFLAGS += -mllvm -do-block-reorder=aggressive EXTRA_LDFLAGS = -Wl,-mllvm -Wl,-function-specialize \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDFFLAGS = -Wl,-mllvm -Wl,-enable-X86-prefetching \ -Wl,-mllvm -Wl,-enable-licm-vrp LDCXXFLAGS = -Wl,-mllvm -Wl,-x86-use-vzeroupper=false \ -Wl,-mllvm -Wl,-enable-licm-vrp \ -Wl,-mllvm -Wl,-do-block-reorder=aggressive EXTRA_LIBS = -fopenmp=libomp -lomp -lamdlibm -ljemalloc -lflang -lm EXTRA_OPTIMIZE = -DSPEC_OPENMP -fopenmp -Wno-return-type feedback = 0 PASS1_CFLAGS = -fprofile-instr-generate PASS2_CFLAGS = -fprofile-instr-use PASS1_FFLAGS = -fprofile-generate PASS2_FFLAGS = -fprofile-use PASS1_CXXFLAGS = -fprofile-instr-generate PASS2_CXXFLAGS = -fprofile-instr-use PASS1_LDFLAGS = -fprofile-instr-generate PASS2_LDFLAGS = -fprofile-instr-use fdo_run1 = $command ; llvm-profdata merge --output=default.profdata *.profraw ######################################### # Benchmark specific peak tuning flags: # ######################################### ################################################################################ 600.perlbench_s=peak: # C submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 602.gcc_s=peak: # C submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 603.bwaves_s=peak: # Fortran FOPTIMIZE = -Ofast $(OPT_ROOT) -Mrecursive \ -mllvm -reduce-array-computations=3 \ -mllvm -global-vectorize-slp=true \ -mllvm -enable-licm-vrp \ -Wno-unused-command-line-argument submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 605.mcf_s=peak: # C submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 607.cactuBSSN_s=peak: # C, C++, Fortran submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 620.omnetpp_s =peak: # C++ submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 621.wrf_s=peak: # Fortran, C FOPTIMIZE = -Hz,1,0x1 $(OPT_ROOT_PEAK) -Mrecursive \ -mllvm -fuse-tile-inner-loop -funroll-loops \ -mllvm -extra-vectorizer-passes \ -mllvm -lsr-in-nested-loop \ -mllvm -enable-licm-vrp \ -mllvm -reduce-array-computations=3 \ -mllvm -global-vectorize-slp=true \ -Wno-unused-command-line-argument \ -mllvm -enable-loopinterchange \ -mllvm -compute-interchange-order LDFFLAGS = -Wl,-mllvm -Wl,-enable-X86-prefetching \ -Wl,-mllvm -Wl,-enable-licm-vrp submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 623.xalancbmk_s=peak: # C++ EXTRA_CXXFLAGS = -mllvm -do-block-reorder=aggressive \ -fvirtual-function-elimination -fvisibility=hidden EXTRA_LDFLAGS = -Wl,-mllvm -Wl,-function-specialize \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 \ -Wl,-mllvm -Wl,-do-block-reorder=aggressive submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 625.x264_s=peak: # C COPTIMIZE = $(OPT_ROOT_PEAK) -fstruct-layout=5 \ -mllvm -unroll-threshold=50 \ -mllvm -inline-threshold=1000 -fremap-arrays \ -mllvm -function-specialize -flv-function-specialization \ -mllvm -enable-gvn-hoist \ -mllvm -global-vectorize-slp=true \ -mllvm -enable-licm-vrp \ -mllvm -reduce-array-computations=3 \ -Wno-unused-command-line-argument \ -mllvm -do-block-reorder=aggressive EXTRA_LDFLAGS = -Wl,-mllvm -Wl,-do-block-reorder=aggressive \ -Wl,-mllvm -Wl,-region-vectorize \ -Wl,-mllvm -Wl,-function-specialize \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDCFLAGS = -Wl,-allow-multiple-definition \ -Wl,-mllvm -Wl,-enable-licm-vrp submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 627.cam4_s=peak: # Fortran, C FOPTIMIZE = -Hz,1,0x1 $(OPT_ROOT_PEAK) -Mrecursive \ -mllvm -fuse-tile-inner-loop -funroll-loops \ -mllvm -extra-vectorizer-passes \ -mllvm -lsr-in-nested-loop \ -mllvm -enable-licm-vrp \ -mllvm -reduce-array-computations=3 \ -mllvm -global-vectorize-slp=true \ -Wno-unused-command-line-argument \ -mllvm -enable-loopinterchange \ -mllvm -compute-interchange-order LDFFLAGS = -Wl,-mllvm -Wl,-enable-X86-prefetching \ -Wl,-mllvm -Wl,-enable-licm-vrp submit = echo never > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 628.pop2_s=peak: # Fortran90, C FOPTIMIZE = $(OPT_ROOT) -Ofast -Mrecursive \ -mllvm -reduce-array-computations=3 \ -mllvm -global-vectorize-slp=true \ -mllvm -enable-licm-vrp \ -Wno-unused-command-line-argument submit = echo madvise > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 631.deepsjeng_s=peak: # C++ CXXOPTIMIZE = $(OPT_ROOT_BASE) -flto \ -mllvm -enable-partial-unswitch \ -mllvm -unroll-threshold=100 \ -finline-aggressive -flv-function-specialization \ -mllvm -loop-unswitch-threshold=200000 \ -mllvm -reroll-loops \ -mllvm -aggressive-loop-unswitch \ -mllvm -extra-vectorizer-passes \ -mllvm -reduce-array-computations=3 \ -mllvm -global-vectorize-slp=true \ -Wno-unused-command-line-argument \ -mllvm -convert-pow-exp-to-int=false EXTRA_CXXFLAGS = -mllvm -do-block-reorder=aggressive \ -fvirtual-function-elimination -fvisibility=hidden EXTRA_LDFLAGS = -Wl,-mllvm -Wl,-do-block-reorder=aggressive \ -Wl,-mllvm -Wl,-region-vectorize \ -Wl,-mllvm -Wl,-function-specialize \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 # LDCXXFLAGS is left empty as intspeed CPP bmks have to use VZEROUPPER # instruction which is the default. LDCXXFLAGS = submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 638.imagick_s=peak: # C COPTIMIZE = $(OPT_ROOT_PEAK) -fstruct-layout=5 \ -mllvm -unroll-threshold=50 \ -mllvm -inline-threshold=1000 -fremap-arrays \ -mllvm -function-specialize -flv-function-specialization \ -mllvm -enable-gvn-hoist \ -mllvm -global-vectorize-slp=true \ -mllvm -enable-licm-vrp \ -mllvm -reduce-array-computations=3 \ -Wno-unused-command-line-argument \ -mllvm -do-block-reorder=aggressive EXTRA_LDFLAGS = -Wl,-mllvm -Wl,-do-block-reorder=aggressive \ -Wl,-mllvm -Wl,-region-vectorize \ -Wl,-mllvm -Wl,-function-specialize \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDCFLAGS = -Wl,-allow-multiple-definition -Wl,-mllvm \ -Wl,-enable-licm-vrp submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 644.nab_s=peak: # C COPTIMIZE = $(OPT_ROOT_PEAK) -fstruct-layout=5 \ -mllvm -unroll-threshold=50 \ -mllvm -inline-threshold=1000 -fremap-arrays \ -mllvm -function-specialize -flv-function-specialization \ -mllvm -enable-gvn-hoist \ -mllvm -global-vectorize-slp=true \ -mllvm -enable-licm-vrp \ -mllvm -reduce-array-computations=3 \ -Wno-unused-command-line-argument \ -mllvm -do-block-reorder=aggressive EXTRA_LDFLAGS = -Wl,-mllvm -Wl,-do-block-reorder=aggressive \ -Wl,-mllvm -Wl,-region-vectorize \ -Wl,-mllvm -Wl,-function-specialize \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDCFLAGS = -Wl,-allow-multiple-definition \ -Wl,-mllvm -Wl,-enable-licm-vrp submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 648.exchange2_s=peak: # Fortran submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 649.fotonik3d_s=peak: # Fortran 95 + OpenMP ENV_PGHPF_ZMEM =yes submit = echo never > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 654.roms_s=peak: # Fortran 2003 FOPTIMIZE = -Ofast $(OPT_ROOT) -Mrecursive \ -mllvm -reduce-array-computations=3 \ -mllvm -global-vectorize-slp=true \ -mllvm -enable-licm-vrp \ -Wno-unused-command-line-argument submit = echo never > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ ################################################################################ 657.xz_s=peak: # C99 COPTIMIZE = $(OPT_ROOT_PEAK) -fstruct-layout=5 \ -mllvm -unroll-threshold=50 \ -mllvm -inline-threshold=1000 -fremap-arrays \ -mllvm -function-specialize -flv-function-specialization \ -mllvm -enable-gvn-hoist \ -mllvm -global-vectorize-slp=true \ -mllvm -enable-licm-vrp \ -mllvm -reduce-array-computations=3 \ -Wno-unused-command-line-argument \ -mllvm -do-block-reorder=aggressive EXTRA_LDFLAGS = -Wl,-mllvm -Wl,-do-block-reorder=aggressive \ -Wl,-mllvm -Wl,-region-vectorize \ -Wl,-mllvm -Wl,-function-specialize \ -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 \ -Wl,-mllvm -Wl,-reduce-array-computations=3 LDCFLAGS = -Wl,-allow-multiple-definition \ -Wl,-mllvm -Wl,-enable-licm-vrp ENV_LIBOMP_NUM_HIDDEN_HELPER_THREADS = 0 submit = echo always > /sys/kernel/mm/transparent_hugepage/enabled; $BIND $command ################################################################################ # The following settings were obtained by running the sysinfo_program # 'specperl $[top]/bin/sysinfo' (sysinfo:SHA:679c83684f6f4fc369a093999b6661d0a378911de2a006d3245423ad80d3fb9a) default: notes_plat_sysinfo_000 = notes_plat_sysinfo_005 = Sysinfo program /home/cpu2017/bin/sysinfo notes_plat_sysinfo_010 = Rev: r6622 of 2021-04-07 982a61ec0915b55891ef0e16acafc64d notes_plat_sysinfo_015 = running on localhost Fri Oct 7 08:40:19 2022 notes_plat_sysinfo_020 = notes_plat_sysinfo_025 = SUT (System Under Test) info as seen by some common utilities. notes_plat_sysinfo_030 = For more information on this section, see notes_plat_sysinfo_035 = https://www.spec.org/cpu2017/Docs/config.html#sysinfo notes_plat_sysinfo_040 = notes_plat_sysinfo_045 = From /proc/cpuinfo notes_plat_sysinfo_050 = model name : AMD EPYC 7662 64-Core Processor notes_plat_sysinfo_055 = 2 "physical id"s (chips) notes_plat_sysinfo_060 = 128 "processors" notes_plat_sysinfo_065 = cores, siblings (Caution: counting these is hw and system dependent. The following notes_plat_sysinfo_070 = excerpts from /proc/cpuinfo might not be reliable. Use with caution.) notes_plat_sysinfo_075 = cpu cores : 64 notes_plat_sysinfo_080 = siblings : 64 notes_plat_sysinfo_085 = physical 0: cores 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 notes_plat_sysinfo_090 = 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 notes_plat_sysinfo_095 = 53 54 55 56 57 58 59 60 61 62 63 notes_plat_sysinfo_100 = physical 1: cores 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 notes_plat_sysinfo_105 = 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 notes_plat_sysinfo_110 = 53 54 55 56 57 58 59 60 61 62 63 notes_plat_sysinfo_115 = notes_plat_sysinfo_120 = From lscpu from util-linux 2.33.1: notes_plat_sysinfo_125 = Architecture: x86_64 notes_plat_sysinfo_130 = CPU op-mode(s): 32-bit, 64-bit notes_plat_sysinfo_135 = Byte Order: Little Endian notes_plat_sysinfo_140 = Address sizes: 43 bits physical, 48 bits virtual notes_plat_sysinfo_145 = CPU(s): 128 notes_plat_sysinfo_150 = On-line CPU(s) list: 0-127 notes_plat_sysinfo_155 = Thread(s) per core: 1 notes_plat_sysinfo_160 = Core(s) per socket: 64 notes_plat_sysinfo_165 = Socket(s): 2 notes_plat_sysinfo_170 = NUMA node(s): 32 notes_plat_sysinfo_175 = Vendor ID: AuthenticAMD notes_plat_sysinfo_180 = CPU family: 23 notes_plat_sysinfo_185 = Model: 49 notes_plat_sysinfo_190 = Model name: AMD EPYC 7662 64-Core Processor notes_plat_sysinfo_195 = Stepping: 0 notes_plat_sysinfo_200 = CPU MHz: 1664.615 notes_plat_sysinfo_205 = CPU max MHz: 2000.0000 notes_plat_sysinfo_210 = CPU min MHz: 1500.0000 notes_plat_sysinfo_215 = BogoMIPS: 3992.42 notes_plat_sysinfo_220 = Virtualization: AMD-V notes_plat_sysinfo_225 = L1d cache: 32K notes_plat_sysinfo_230 = L1i cache: 32K notes_plat_sysinfo_235 = L2 cache: 512K notes_plat_sysinfo_240 = L3 cache: 16384K notes_plat_sysinfo_245 = NUMA node0 CPU(s): 0-3 notes_plat_sysinfo_250 = NUMA node1 CPU(s): 4-7 notes_plat_sysinfo_255 = NUMA node2 CPU(s): 8-11 notes_plat_sysinfo_260 = NUMA node3 CPU(s): 12-15 notes_plat_sysinfo_265 = NUMA node4 CPU(s): 16-19 notes_plat_sysinfo_270 = NUMA node5 CPU(s): 20-23 notes_plat_sysinfo_275 = NUMA node6 CPU(s): 24-27 notes_plat_sysinfo_280 = NUMA node7 CPU(s): 28-31 notes_plat_sysinfo_285 = NUMA node8 CPU(s): 32-35 notes_plat_sysinfo_290 = NUMA node9 CPU(s): 36-39 notes_plat_sysinfo_295 = NUMA node10 CPU(s): 40-43 notes_plat_sysinfo_300 = NUMA node11 CPU(s): 44-47 notes_plat_sysinfo_305 = NUMA node12 CPU(s): 48-51 notes_plat_sysinfo_310 = NUMA node13 CPU(s): 52-55 notes_plat_sysinfo_315 = NUMA node14 CPU(s): 56-59 notes_plat_sysinfo_320 = NUMA node15 CPU(s): 60-63 notes_plat_sysinfo_325 = NUMA node16 CPU(s): 64-67 notes_plat_sysinfo_330 = NUMA node17 CPU(s): 68-71 notes_plat_sysinfo_335 = NUMA node18 CPU(s): 72-75 notes_plat_sysinfo_340 = NUMA node19 CPU(s): 76-79 notes_plat_sysinfo_345 = NUMA node20 CPU(s): 80-83 notes_plat_sysinfo_350 = NUMA node21 CPU(s): 84-87 notes_plat_sysinfo_355 = NUMA node22 CPU(s): 88-91 notes_plat_sysinfo_360 = NUMA node23 CPU(s): 92-95 notes_plat_sysinfo_365 = NUMA node24 CPU(s): 96-99 notes_plat_sysinfo_370 = NUMA node25 CPU(s): 100-103 notes_plat_sysinfo_375 = NUMA node26 CPU(s): 104-107 notes_plat_sysinfo_380 = NUMA node27 CPU(s): 108-111 notes_plat_sysinfo_385 = NUMA node28 CPU(s): 112-115 notes_plat_sysinfo_390 = NUMA node29 CPU(s): 116-119 notes_plat_sysinfo_395 = NUMA node30 CPU(s): 120-123 notes_plat_sysinfo_400 = NUMA node31 CPU(s): 124-127 notes_plat_sysinfo_405 = Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov notes_plat_sysinfo_410 = pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm notes_plat_sysinfo_415 = constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq notes_plat_sysinfo_420 = monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm notes_plat_sysinfo_425 = cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs notes_plat_sysinfo_430 = skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 notes_plat_sysinfo_435 = cdp_l3 hw_pstate sme ssbd mba sev ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep notes_plat_sysinfo_440 = bmi2 cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves notes_plat_sysinfo_445 = cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr wbnoinvd notes_plat_sysinfo_450 = arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists notes_plat_sysinfo_455 = pausefilter pfthreshold avic v_vmsave_vmload vgif umip rdpid overflow_recov succor notes_plat_sysinfo_460 = smca notes_plat_sysinfo_465 = notes_plat_sysinfo_470 = /proc/cpuinfo cache data notes_plat_sysinfo_475 = cache size : 512 KB notes_plat_sysinfo_480 = notes_plat_sysinfo_485 = From numactl --hardware notes_plat_sysinfo_490 = WARNING: a numactl 'node' might or might not correspond to a physical chip. notes_plat_sysinfo_495 = available: 32 nodes (0-31) notes_plat_sysinfo_500 = node 0 cpus: 0 1 2 3 notes_plat_sysinfo_505 = node 0 size: 64325 MB notes_plat_sysinfo_510 = node 0 free: 64243 MB notes_plat_sysinfo_515 = node 1 cpus: 4 5 6 7 notes_plat_sysinfo_520 = node 1 size: 64510 MB notes_plat_sysinfo_525 = node 1 free: 64389 MB notes_plat_sysinfo_530 = node 2 cpus: 8 9 10 11 notes_plat_sysinfo_535 = node 2 size: 64510 MB notes_plat_sysinfo_540 = node 2 free: 64431 MB notes_plat_sysinfo_545 = node 3 cpus: 12 13 14 15 notes_plat_sysinfo_550 = node 3 size: 64510 MB notes_plat_sysinfo_555 = node 3 free: 64449 MB notes_plat_sysinfo_560 = node 4 cpus: 16 17 18 19 notes_plat_sysinfo_565 = node 4 size: 64477 MB notes_plat_sysinfo_570 = node 4 free: 64376 MB notes_plat_sysinfo_575 = node 5 cpus: 20 21 22 23 notes_plat_sysinfo_580 = node 5 size: 64510 MB notes_plat_sysinfo_585 = node 5 free: 64449 MB notes_plat_sysinfo_590 = node 6 cpus: 24 25 26 27 notes_plat_sysinfo_595 = node 6 size: 64510 MB notes_plat_sysinfo_600 = node 6 free: 64365 MB notes_plat_sysinfo_605 = node 7 cpus: 28 29 30 31 notes_plat_sysinfo_610 = node 7 size: 64510 MB notes_plat_sysinfo_615 = node 7 free: 64457 MB notes_plat_sysinfo_620 = node 8 cpus: 32 33 34 35 notes_plat_sysinfo_625 = node 8 size: 64510 MB notes_plat_sysinfo_630 = node 8 free: 64456 MB notes_plat_sysinfo_635 = node 9 cpus: 36 37 38 39 notes_plat_sysinfo_640 = node 9 size: 64510 MB notes_plat_sysinfo_645 = node 9 free: 64465 MB notes_plat_sysinfo_650 = node 10 cpus: 40 41 42 43 notes_plat_sysinfo_655 = node 10 size: 64510 MB notes_plat_sysinfo_660 = node 10 free: 64463 MB notes_plat_sysinfo_665 = node 11 cpus: 44 45 46 47 notes_plat_sysinfo_670 = node 11 size: 64510 MB notes_plat_sysinfo_675 = node 11 free: 64464 MB notes_plat_sysinfo_680 = node 12 cpus: 48 49 50 51 notes_plat_sysinfo_685 = node 12 size: 64510 MB notes_plat_sysinfo_690 = node 12 free: 64466 MB notes_plat_sysinfo_695 = node 13 cpus: 52 53 54 55 notes_plat_sysinfo_700 = node 13 size: 64510 MB notes_plat_sysinfo_705 = node 13 free: 64465 MB notes_plat_sysinfo_710 = node 14 cpus: 56 57 58 59 notes_plat_sysinfo_715 = node 14 size: 64510 MB notes_plat_sysinfo_720 = node 14 free: 64465 MB notes_plat_sysinfo_725 = node 15 cpus: 60 61 62 63 notes_plat_sysinfo_730 = node 15 size: 52398 MB notes_plat_sysinfo_735 = node 15 free: 52353 MB notes_plat_sysinfo_740 = node 16 cpus: 64 65 66 67 notes_plat_sysinfo_745 = node 16 size: 64510 MB notes_plat_sysinfo_750 = node 16 free: 64462 MB notes_plat_sysinfo_755 = node 17 cpus: 68 69 70 71 notes_plat_sysinfo_760 = node 17 size: 64510 MB notes_plat_sysinfo_765 = node 17 free: 64462 MB notes_plat_sysinfo_770 = node 18 cpus: 72 73 74 75 notes_plat_sysinfo_775 = node 18 size: 64510 MB notes_plat_sysinfo_780 = node 18 free: 64459 MB notes_plat_sysinfo_785 = node 19 cpus: 76 77 78 79 notes_plat_sysinfo_790 = node 19 size: 64510 MB notes_plat_sysinfo_795 = node 19 free: 64465 MB notes_plat_sysinfo_800 = node 20 cpus: 80 81 82 83 notes_plat_sysinfo_805 = node 20 size: 64510 MB notes_plat_sysinfo_810 = node 20 free: 64465 MB notes_plat_sysinfo_815 = node 21 cpus: 84 85 86 87 notes_plat_sysinfo_820 = node 21 size: 64510 MB notes_plat_sysinfo_825 = node 21 free: 64465 MB notes_plat_sysinfo_830 = node 22 cpus: 88 89 90 91 notes_plat_sysinfo_835 = node 22 size: 64510 MB notes_plat_sysinfo_840 = node 22 free: 64466 MB notes_plat_sysinfo_845 = node 23 cpus: 92 93 94 95 notes_plat_sysinfo_850 = node 23 size: 64510 MB notes_plat_sysinfo_855 = node 23 free: 64465 MB notes_plat_sysinfo_860 = node 24 cpus: 96 97 98 99 notes_plat_sysinfo_865 = node 24 size: 64510 MB notes_plat_sysinfo_870 = node 24 free: 64443 MB notes_plat_sysinfo_875 = node 25 cpus: 100 101 102 103 notes_plat_sysinfo_880 = node 25 size: 64510 MB notes_plat_sysinfo_885 = node 25 free: 64461 MB notes_plat_sysinfo_890 = node 26 cpus: 104 105 106 107 notes_plat_sysinfo_895 = node 26 size: 64510 MB notes_plat_sysinfo_900 = node 26 free: 64362 MB notes_plat_sysinfo_905 = node 27 cpus: 108 109 110 111 notes_plat_sysinfo_910 = node 27 size: 64510 MB notes_plat_sysinfo_915 = node 27 free: 64457 MB notes_plat_sysinfo_920 = node 28 cpus: 112 113 114 115 notes_plat_sysinfo_925 = node 28 size: 64510 MB notes_plat_sysinfo_930 = node 28 free: 64460 MB notes_plat_sysinfo_935 = node 29 cpus: 116 117 118 119 notes_plat_sysinfo_940 = node 29 size: 64510 MB notes_plat_sysinfo_945 = node 29 free: 64412 MB notes_plat_sysinfo_950 = node 30 cpus: 120 121 122 123 notes_plat_sysinfo_955 = node 30 size: 64510 MB notes_plat_sysinfo_960 = node 30 free: 64464 MB notes_plat_sysinfo_965 = node 31 cpus: 124 125 126 127 notes_plat_sysinfo_970 = node 31 size: 64505 MB notes_plat_sysinfo_975 = node 31 free: 64458 MB notes_plat_sysinfo_980 = node distances: notes_plat_sysinfo_985 = node 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 notes_plat_sysinfo_990 = 20 21 22 23 24 25 26 27 28 29 30 31 notes_plat_sysinfo_995 = 0: 10 11 11 11 11 11 11 11 11 11 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1000 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1005 = 1: 11 10 11 11 11 11 11 11 11 11 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1010 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1015 = 2: 11 11 10 11 11 11 11 11 11 11 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1020 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1025 = 3: 11 11 11 10 11 11 11 11 11 11 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1030 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1035 = 4: 11 11 11 11 10 11 11 11 11 11 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1040 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1045 = 5: 11 11 11 11 11 10 11 11 11 11 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1050 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1055 = 6: 11 11 11 11 11 11 10 11 11 11 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1060 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1065 = 7: 11 11 11 11 11 11 11 10 11 11 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1070 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1075 = 8: 11 11 11 11 11 11 11 11 10 11 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1080 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1085 = 9: 11 11 11 11 11 11 11 11 11 10 11 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1090 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1095 = 10: 11 11 11 11 11 11 11 11 11 11 10 11 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1100 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1105 = 11: 11 11 11 11 11 11 11 11 11 11 11 10 11 11 11 11 32 32 32 32 notes_plat_sysinfo_1110 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1115 = 12: 11 11 11 11 11 11 11 11 11 11 11 11 10 11 11 11 32 32 32 32 notes_plat_sysinfo_1120 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1125 = 13: 11 11 11 11 11 11 11 11 11 11 11 11 11 10 11 11 32 32 32 32 notes_plat_sysinfo_1130 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1135 = 14: 11 11 11 11 11 11 11 11 11 11 11 11 11 11 10 11 32 32 32 32 notes_plat_sysinfo_1140 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1145 = 15: 11 11 11 11 11 11 11 11 11 11 11 11 11 11 11 10 32 32 32 32 notes_plat_sysinfo_1150 = 32 32 32 32 32 32 32 32 32 32 32 32 notes_plat_sysinfo_1155 = 16: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 10 11 11 11 notes_plat_sysinfo_1160 = 11 11 11 11 11 11 11 11 11 11 11 11 notes_plat_sysinfo_1165 = 17: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 10 11 11 notes_plat_sysinfo_1170 = 11 11 11 11 11 11 11 11 11 11 11 11 notes_plat_sysinfo_1175 = 18: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 10 11 notes_plat_sysinfo_1180 = 11 11 11 11 11 11 11 11 11 11 11 11 notes_plat_sysinfo_1185 = 19: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 10 notes_plat_sysinfo_1190 = 11 11 11 11 11 11 11 11 11 11 11 11 notes_plat_sysinfo_1195 = 20: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1200 = 10 11 11 11 11 11 11 11 11 11 11 11 notes_plat_sysinfo_1205 = 21: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1210 = 11 10 11 11 11 11 11 11 11 11 11 11 notes_plat_sysinfo_1215 = 22: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1220 = 11 11 10 11 11 11 11 11 11 11 11 11 notes_plat_sysinfo_1225 = 23: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1230 = 11 11 11 10 11 11 11 11 11 11 11 11 notes_plat_sysinfo_1235 = 24: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1240 = 11 11 11 11 10 11 11 11 11 11 11 11 notes_plat_sysinfo_1245 = 25: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1250 = 11 11 11 11 11 10 11 11 11 11 11 11 notes_plat_sysinfo_1255 = 26: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1260 = 11 11 11 11 11 11 10 11 11 11 11 11 notes_plat_sysinfo_1265 = 27: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1270 = 11 11 11 11 11 11 11 10 11 11 11 11 notes_plat_sysinfo_1275 = 28: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1280 = 11 11 11 11 11 11 11 11 10 11 11 11 notes_plat_sysinfo_1285 = 29: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1290 = 11 11 11 11 11 11 11 11 11 10 11 11 notes_plat_sysinfo_1295 = 30: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1300 = 11 11 11 11 11 11 11 11 11 11 10 11 notes_plat_sysinfo_1305 = 31: 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 11 11 11 11 notes_plat_sysinfo_1310 = 11 11 11 11 11 11 11 11 11 11 11 10 notes_plat_sysinfo_1315 = notes_plat_sysinfo_1320 = From /proc/meminfo notes_plat_sysinfo_1325 = MemTotal: 2101254440 kB notes_plat_sysinfo_1330 = HugePages_Total: 0 notes_plat_sysinfo_1335 = Hugepagesize: 2048 kB notes_plat_sysinfo_1340 = notes_plat_sysinfo_1345 = /sys/devices/system/cpu/cpu*/cpufreq/scaling_governor has notes_plat_sysinfo_1350 = performance notes_plat_sysinfo_1355 = notes_plat_sysinfo_1360 = From /etc/*release* /etc/*version* notes_plat_sysinfo_1365 = os-release: notes_plat_sysinfo_1370 = NAME="SLES" notes_plat_sysinfo_1375 = VERSION="15-SP2" notes_plat_sysinfo_1380 = VERSION_ID="15.2" notes_plat_sysinfo_1385 = PRETTY_NAME="SUSE Linux Enterprise Server 15 SP2" notes_plat_sysinfo_1390 = ID="sles" notes_plat_sysinfo_1395 = ID_LIKE="suse" notes_plat_sysinfo_1400 = ANSI_COLOR="0;32" notes_plat_sysinfo_1405 = CPE_NAME="cpe:/o:suse:sles:15:sp2" notes_plat_sysinfo_1410 = notes_plat_sysinfo_1415 = uname -a: notes_plat_sysinfo_1420 = Linux localhost 5.3.18-22-default #1 SMP Wed Jun 3 12:16:43 UTC 2020 (720aeba) x86_64 notes_plat_sysinfo_1425 = x86_64 x86_64 GNU/Linux notes_plat_sysinfo_1430 = notes_plat_sysinfo_1435 = Kernel self-reported vulnerability status: notes_plat_sysinfo_1440 = notes_plat_sysinfo_1445 = CVE-2018-12207 (iTLB Multihit): Not affected notes_plat_sysinfo_1450 = CVE-2018-3620 (L1 Terminal Fault): Not affected notes_plat_sysinfo_1455 = Microarchitectural Data Sampling: Not affected notes_plat_sysinfo_1460 = CVE-2017-5754 (Meltdown): Not affected notes_plat_sysinfo_1465 = CVE-2018-3639 (Speculative Store Bypass): Mitigation: Speculative Store notes_plat_sysinfo_1470 = Bypass disabled via prctl and notes_plat_sysinfo_1475 = seccomp notes_plat_sysinfo_1480 = CVE-2017-5753 (Spectre variant 1): Mitigation: usercopy/swapgs notes_plat_sysinfo_1485 = barriers and __user pointer notes_plat_sysinfo_1490 = sanitization notes_plat_sysinfo_1495 = CVE-2017-5715 (Spectre variant 2): Mitigation: Full AMD retpoline, notes_plat_sysinfo_1500 = IBPB: conditional, IBRS_FW, STIBP: notes_plat_sysinfo_1505 = disabled, RSB filling notes_plat_sysinfo_1510 = CVE-2020-0543 (Special Register Buffer Data Sampling): Not affected notes_plat_sysinfo_1515 = CVE-2019-11135 (TSX Asynchronous Abort): Not affected notes_plat_sysinfo_1520 = notes_plat_sysinfo_1525 = run-level 3 Apr 17 06:12 notes_plat_sysinfo_1530 = notes_plat_sysinfo_1535 = SPEC is set to: /home/cpu2017 notes_plat_sysinfo_1540 = Filesystem Type Size Used Avail Use% Mounted on notes_plat_sysinfo_1545 = /dev/sda2 xfs 223G 11G 213G 5% / notes_plat_sysinfo_1550 = notes_plat_sysinfo_1555 = From /sys/devices/virtual/dmi/id notes_plat_sysinfo_1560 = Vendor: Cisco Systems Inc notes_plat_sysinfo_1565 = Product: UCSC-C225-M6S notes_plat_sysinfo_1570 = Serial: WZP252408JE notes_plat_sysinfo_1575 = notes_plat_sysinfo_1580 = Additional information from dmidecode 3.2 follows. WARNING: Use caution when you notes_plat_sysinfo_1585 = interpret this section. The 'dmidecode' program reads system data which is "intended to notes_plat_sysinfo_1590 = allow hardware to be accurately determined", but the intent may not be met, as there are notes_plat_sysinfo_1595 = frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard. notes_plat_sysinfo_1600 = Memory: notes_plat_sysinfo_1605 = 16x 0xCE00 M386AAG40AM3-CWE 128 GB 4 rank 3200 notes_plat_sysinfo_1610 = 16x Unknown Unknown notes_plat_sysinfo_1615 = notes_plat_sysinfo_1620 = BIOS: notes_plat_sysinfo_1625 = BIOS Vendor: Cisco Systems, Inc. notes_plat_sysinfo_1630 = BIOS Version: C225M6.4.2.2b.0.0509222122 notes_plat_sysinfo_1635 = BIOS Date: 05/09/2022 notes_plat_sysinfo_1640 = BIOS Revision: 5.14 notes_plat_sysinfo_1645 = notes_plat_sysinfo_1650 = (End of data from sysinfo program) hw_cpu_name = AMD EPYC 7662 hw_disk = 223 GB add more disk info here hw_memory001 = 2003.912 GB fixme: If using DDR4, the format is: hw_memory002 = 'N GB (N x N GB nRxn PC4-nnnnX-X)' hw_nchips = 2 prepared_by = root (is never output, only tags rawfile) sw_file = xfs sw_os001 = NAME="SLES" sw_os002 = 5.3.18-22-default sw_state = Run level 3 (add definition here) # End of settings added by sysinfo_program 648.exchange2_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 641.leela_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 631.deepsjeng_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 625.x264_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 620.omnetpp_s: # The following setting was inserted automatically as a result of # post-run basepeak application. basepeak = 1 # The following section was added automatically, and contains settings that # did not appear in the original configuration file, but were added to the # raw file after the run. default: power_management000 = BIOS and OS set to prefer performance at the cost power_management001 = of additional power usage notes_plat_000 =BIOS Configuration notes_plat_005 = SMT Mode set to Disabled notes_plat_010 = NUMA nodes per socket set to NPS1 notes_plat_015 = ACPI SRAT L3 Cache As NUMA Domain set to Enabled notes_plat_020 = DRAM Scrub Time set to Disabled notes_plat_025 = Determinism Slider set to Power notes_plat_030 = L1 Stream HW Prefetcher set to Enabled notes_plat_035 = APBDIS set to 1