CPU2017 Result Flag Description

Base Portability Flags

503.bwaves_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

507.cactuBSSN_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

508.namd_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

510.parest_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

511.povray_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

519.lbm_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

521.wrf_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.
- -DSPEC_CASE_FLAG
- CPORTABILITY
- This macro indicates that Fortran functions called from C should have their names lower-cased.
- -convert big_endian
- FPORTABILITY
- Specifies that the format will be big endian for INTEGER*1, INTEGER*2, INTEGER*4, or INTEGER*8, and big endian IEEE floating-point for REAL*4, REAL*8, REAL*16, COMPLEX*8, COMPLEX*16, or COMPLEX*32.

526.blender_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.
- -DSPEC_LINUX
- CPORTABILITY
- Linux portability
- -funsigned-char
- CPORTABILITY
- Change default char type to unsigned.

527.cam4_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.
- -DSPEC_CASE_FLAG
- CPORTABILITY
- Fortran to C symbol naming. C symbol names are lower case with one underscore. _symbol

538.imagick_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

544.nab_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

549.fotonik3d_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

554.roms_r

- -DSPEC_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

Base Optimization Flags

C benchmarks

- -w
- CC, LD
- Supress compiler warnings.
- -std=c11
- intel_icc,intel_icx,intel_icpx
- CC, LD
- Sets the language dialect to conform to the indicated C standard.
- -m64
- intel_icc,intel_icpc,intel_ifort,intel_icx,intel_icpx,intel_ifx
- CC, LD
- Compiles for a 64-bit (LP64) data model.
- -Wl,-z,muldefs
- EXTRA_LDFLAGS
- Enable SmartHeap and/or other library usage by forcing the linker to ignore multiple definitions if present
- -xAVX
- COPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -Ofast
- COPTIMIZE
- Enable O3 optimizations plus more aggressive optimizations, such as -ffinite-math-only –no-prec-div
- Includes:
  - -O3
    - -O2
      
      -O1
      
      -funroll-loops
      
      -fno-builtin
      
      -mno-ieee-fp
      
      -fomit-framepointer
      
      -ffunction-sections
      
      -ftz
- -ffast-math
- COPTIMIZE
- Enable fast math mode. This option may yield faster code for programs that do not require the guarantees of exact implementation of IEEE or ISO rules/specifications for math functions.
- -flto
- COPTIMIZE
- Performs link time optimizations, which is also known as Interprocedural Optimizations.
- -mfpmath=sse
- COPTIMIZE
- Generate floating-point arithmetic for selected unit unit. Here use scalar floating-point instructions present in the SSE instruction set
- -funroll-loops
- COPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -funroll-loops0 would disable unrolling of loops.
- -qopt-mem-layout-trans=4
- COPTIMIZE
- Controls the level of memory layout transformations performed by the compiler. This option can improve cache reuse and cache locality.
  - 0: Disables memory layout transformations. This is the same as specifying -qno-opt-mem-layout-trans
  - 1: Enable basic memory layout transformations like structure splitting, structure peeling, field inlining, field reordering, array field transpose, increase field alignment etc.
  - 2: Enable more memory layout transformations like advanced structure splitting. This is the same as specifying -qopt-mem-layout-trans
  - 3: Enable more memory layout transformations like copy-in/copy-out of structures for a region of code. You should only use this setting if your system has more than 4GB of physical memory per core.
  - 4: Compiler is more aggressive in using memory layout transformations. You should only use this setting if your system has more than 4GB of physical memory per core.
- -Wno-implicit-int
- EXTRA_CFLAGS
- -Wno-implicit-int is needed to allow the compiler to accept invalid C code where the type specifier is missing. With this diagnostic disabled, the missing type will be interpreted as `int`, as in C89 (the last version of C in which implicit type specifiers were allowed).
- -ljemalloc
- EXTRA_LIBS
- Linker toggle to specify jemalloc linker library. See jemalloc.net for more information.
- -L/home/cpu2017/je5.0.1-64/
- EXTRA_LIBS
- Specify build time link path for jemalloc 64bit built to support the CPU 2017 build. See jemalloc.net for more information.

C++ benchmarks

- -w
- CXX, LD
- Supress compiler warnings.
- -std=c++14
- intel_icpc,intel_icx,intel_icpx
- CXX, LD
- Sets the language dialect to conform to the indicated C++ standard.
- -m64
- intel_icc,intel_icpc,intel_ifort,intel_icx,intel_icpx,intel_ifx
- CXX, LD
- Compiles for a 64-bit (LP64) data model.
- -Wl,-z,muldefs
- EXTRA_LDFLAGS
- Enable SmartHeap and/or other library usage by forcing the linker to ignore multiple definitions if present
- -xAVX
- CXXOPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -Ofast
- CXXOPTIMIZE
- Enable O3 optimizations plus more aggressive optimizations, such as -ffinite-math-only –no-prec-div
- Includes:
  - -O3
    - -O2
      
      -O1
      
      -funroll-loops
      
      -fno-builtin
      
      -mno-ieee-fp
      
      -fomit-framepointer
      
      -ffunction-sections
      
      -ftz
- -ffast-math
- CXXOPTIMIZE
- Enable fast math mode. This option may yield faster code for programs that do not require the guarantees of exact implementation of IEEE or ISO rules/specifications for math functions.
- -flto
- CXXOPTIMIZE
- Performs link time optimizations, which is also known as Interprocedural Optimizations.
- -mfpmath=sse
- CXXOPTIMIZE
- Generate floating-point arithmetic for selected unit unit. Here use scalar floating-point instructions present in the SSE instruction set
- -funroll-loops
- CXXOPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -funroll-loops0 would disable unrolling of loops.
- -qopt-mem-layout-trans=4
- CXXOPTIMIZE
- Controls the level of memory layout transformations performed by the compiler. This option can improve cache reuse and cache locality.
  - 0: Disables memory layout transformations. This is the same as specifying -qno-opt-mem-layout-trans
  - 1: Enable basic memory layout transformations like structure splitting, structure peeling, field inlining, field reordering, array field transpose, increase field alignment etc.
  - 2: Enable more memory layout transformations like advanced structure splitting. This is the same as specifying -qopt-mem-layout-trans
  - 3: Enable more memory layout transformations like copy-in/copy-out of structures for a region of code. You should only use this setting if your system has more than 4GB of physical memory per core.
  - 4: Compiler is more aggressive in using memory layout transformations. You should only use this setting if your system has more than 4GB of physical memory per core.
- -ljemalloc
- EXTRA_LIBS
- Linker toggle to specify jemalloc linker library. See jemalloc.net for more information.
- -L/home/cpu2017/je5.0.1-64/
- EXTRA_LIBS
- Specify build time link path for jemalloc 64bit built to support the CPU 2017 build. See jemalloc.net for more information.

Fortran benchmarks

- -w
- FC, LD
- Supress compiler warnings.
- -m64
- intel_icc,intel_icpc,intel_ifort,intel_icx,intel_icpx,intel_ifx
- FC, LD
- Compiles for a 64-bit (LP64) data model.
- -Wl,-z,muldefs
- EXTRA_LDFLAGS
- Enable SmartHeap and/or other library usage by forcing the linker to ignore multiple definitions if present
- -xAVX
- FOPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -Ofast
- FOPTIMIZE
- Enable O3 optimizations plus more aggressive optimizations, such as -ffinite-math-only –no-prec-div
- Includes:
  - -O3
    - -O2
      
      -O1
      
      -funroll-loops
      
      -fno-builtin
      
      -mno-ieee-fp
      
      -fomit-framepointer
      
      -ffunction-sections
      
      -ftz
- -ffast-math
- FOPTIMIZE
- Enable fast math mode. This option may yield faster code for programs that do not require the guarantees of exact implementation of IEEE or ISO rules/specifications for math functions.
- -flto
- FOPTIMIZE
- Performs link time optimizations, which is also known as Interprocedural Optimizations.
- -mfpmath=sse
- FOPTIMIZE
- Generate floating-point arithmetic for selected unit unit. Here use scalar floating-point instructions present in the SSE instruction set
- -funroll-loops
- FOPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -funroll-loops0 would disable unrolling of loops.
- -qopt-mem-layout-trans=4
- FOPTIMIZE
- Controls the level of memory layout transformations performed by the compiler. This option can improve cache reuse and cache locality.
  - 0: Disables memory layout transformations. This is the same as specifying -qno-opt-mem-layout-trans
  - 1: Enable basic memory layout transformations like structure splitting, structure peeling, field inlining, field reordering, array field transpose, increase field alignment etc.
  - 2: Enable more memory layout transformations like advanced structure splitting. This is the same as specifying -qopt-mem-layout-trans
  - 3: Enable more memory layout transformations like copy-in/copy-out of structures for a region of code. You should only use this setting if your system has more than 4GB of physical memory per core.
  - 4: Compiler is more aggressive in using memory layout transformations. You should only use this setting if your system has more than 4GB of physical memory per core.
- -nostandard-realloc-lhs
- EXTRA_FOPTIMIZE
- Option standard-realloc-lhs (the default), tells the compiler that when the left-hand side of an assignment is an allocatable object, it should be reallocated to the shape of the right-hand side of the assignment before the assignment occurs. This is the current Fortran Standard definition. This feature may cause extra overhead at run time. This option has the same effect as option assume realloc_lhs.
  
  If you specify nostandard-realloc-lhs, the compiler uses the old Fortran 2003 rules when interpreting assignment statements. The left-hand side is assumed to be allocated with the correct shape to hold the right-hand side. If it is not, incorrect behavior will occur. This option has the same effect as option assume norealloc_lhs.
- -align array32byte
- EXTRA_FOPTIMIZE
- The align toggle changes how data elements are aligned. Variables and arrays are analyzed and memory layout can be altered. Specifying array32byte will look for opportunities to transform and reailgn arrays to 32byte boundaries.
- -auto
- EXTRA_FOPTIMIZE
- Make all local variables AUTOMATIC. Same as -automatic
- -ljemalloc
- EXTRA_LIBS
- Linker toggle to specify jemalloc linker library. See jemalloc.net for more information.
- -L/home/cpu2017/je5.0.1-64/
- EXTRA_LIBS
- Specify build time link path for jemalloc 64bit built to support the CPU 2017 build. See jemalloc.net for more information.

Benchmarks using both Fortran and C

- -w
- CC, FC, LD
- Supress compiler warnings.
- -m64
- intel_icc,intel_icpc,intel_ifort,intel_icx,intel_icpx,intel_ifx
- CC, FC, LD
- Compiles for a 64-bit (LP64) data model.
- -std=c11
- intel_icc,intel_icx,intel_icpx
- CC
- Sets the language dialect to conform to the indicated C standard.
- -Wl,-z,muldefs
- EXTRA_LDFLAGS
- Enable SmartHeap and/or other library usage by forcing the linker to ignore multiple definitions if present
- -xAVX
- COPTIMIZE, FOPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -Ofast
- COPTIMIZE, FOPTIMIZE
- Enable O3 optimizations plus more aggressive optimizations, such as -ffinite-math-only –no-prec-div
- Includes:
  - -O3
    - -O2
      
      -O1
      
      -funroll-loops
      
      -fno-builtin
      
      -mno-ieee-fp
      
      -fomit-framepointer
      
      -ffunction-sections
      
      -ftz
- -ffast-math
- COPTIMIZE, FOPTIMIZE
- Enable fast math mode. This option may yield faster code for programs that do not require the guarantees of exact implementation of IEEE or ISO rules/specifications for math functions.
- -flto
- COPTIMIZE, FOPTIMIZE
- Performs link time optimizations, which is also known as Interprocedural Optimizations.
- -mfpmath=sse
- COPTIMIZE, FOPTIMIZE
- Generate floating-point arithmetic for selected unit unit. Here use scalar floating-point instructions present in the SSE instruction set
- -funroll-loops
- COPTIMIZE, FOPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -funroll-loops0 would disable unrolling of loops.
- -qopt-mem-layout-trans=4
- COPTIMIZE, FOPTIMIZE
- Controls the level of memory layout transformations performed by the compiler. This option can improve cache reuse and cache locality.
  - 0: Disables memory layout transformations. This is the same as specifying -qno-opt-mem-layout-trans
  - 1: Enable basic memory layout transformations like structure splitting, structure peeling, field inlining, field reordering, array field transpose, increase field alignment etc.
  - 2: Enable more memory layout transformations like advanced structure splitting. This is the same as specifying -qopt-mem-layout-trans
  - 3: Enable more memory layout transformations like copy-in/copy-out of structures for a region of code. You should only use this setting if your system has more than 4GB of physical memory per core.
  - 4: Compiler is more aggressive in using memory layout transformations. You should only use this setting if your system has more than 4GB of physical memory per core.
- -Wno-implicit-int
- EXTRA_CFLAGS
- -Wno-implicit-int is needed to allow the compiler to accept invalid C code where the type specifier is missing. With this diagnostic disabled, the missing type will be interpreted as `int`, as in C89 (the last version of C in which implicit type specifiers were allowed).
- -nostandard-realloc-lhs
- EXTRA_FOPTIMIZE
- Option standard-realloc-lhs (the default), tells the compiler that when the left-hand side of an assignment is an allocatable object, it should be reallocated to the shape of the right-hand side of the assignment before the assignment occurs. This is the current Fortran Standard definition. This feature may cause extra overhead at run time. This option has the same effect as option assume realloc_lhs.
  
  If you specify nostandard-realloc-lhs, the compiler uses the old Fortran 2003 rules when interpreting assignment statements. The left-hand side is assumed to be allocated with the correct shape to hold the right-hand side. If it is not, incorrect behavior will occur. This option has the same effect as option assume norealloc_lhs.
- -align array32byte
- EXTRA_FOPTIMIZE
- The align toggle changes how data elements are aligned. Variables and arrays are analyzed and memory layout can be altered. Specifying array32byte will look for opportunities to transform and reailgn arrays to 32byte boundaries.
- -auto
- EXTRA_FOPTIMIZE
- Make all local variables AUTOMATIC. Same as -automatic
- -ljemalloc
- EXTRA_LIBS
- Linker toggle to specify jemalloc linker library. See jemalloc.net for more information.
- -L/home/cpu2017/je5.0.1-64/
- EXTRA_LIBS
- Specify build time link path for jemalloc 64bit built to support the CPU 2017 build. See jemalloc.net for more information.

Benchmarks using both C and C++

- -w
- CC, CXX, LD
- Supress compiler warnings.
- -std=c++14
- intel_icpc,intel_icx,intel_icpx
- CXX, LD
- Sets the language dialect to conform to the indicated C++ standard.
- -m64
- intel_icc,intel_icpc,intel_ifort,intel_icx,intel_icpx,intel_ifx
- CC, CXX, LD
- Compiles for a 64-bit (LP64) data model.
- -std=c11
- intel_icc,intel_icx,intel_icpx
- CC
- Sets the language dialect to conform to the indicated C standard.
- -Wl,-z,muldefs
- EXTRA_LDFLAGS
- Enable SmartHeap and/or other library usage by forcing the linker to ignore multiple definitions if present
- -xAVX
- COPTIMIZE, CXXOPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -Ofast
- COPTIMIZE, CXXOPTIMIZE
- Enable O3 optimizations plus more aggressive optimizations, such as -ffinite-math-only –no-prec-div
- Includes:
  - -O3
    - -O2
      
      -O1
      
      -funroll-loops
      
      -fno-builtin
      
      -mno-ieee-fp
      
      -fomit-framepointer
      
      -ffunction-sections
      
      -ftz
- -ffast-math
- COPTIMIZE, CXXOPTIMIZE
- Enable fast math mode. This option may yield faster code for programs that do not require the guarantees of exact implementation of IEEE or ISO rules/specifications for math functions.
- -flto
- COPTIMIZE, CXXOPTIMIZE
- Performs link time optimizations, which is also known as Interprocedural Optimizations.
- -mfpmath=sse
- COPTIMIZE, CXXOPTIMIZE
- Generate floating-point arithmetic for selected unit unit. Here use scalar floating-point instructions present in the SSE instruction set
- -funroll-loops
- COPTIMIZE, CXXOPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -funroll-loops0 would disable unrolling of loops.
- -qopt-mem-layout-trans=4
- COPTIMIZE, CXXOPTIMIZE
- Controls the level of memory layout transformations performed by the compiler. This option can improve cache reuse and cache locality.
  - 0: Disables memory layout transformations. This is the same as specifying -qno-opt-mem-layout-trans
  - 1: Enable basic memory layout transformations like structure splitting, structure peeling, field inlining, field reordering, array field transpose, increase field alignment etc.
  - 2: Enable more memory layout transformations like advanced structure splitting. This is the same as specifying -qopt-mem-layout-trans
  - 3: Enable more memory layout transformations like copy-in/copy-out of structures for a region of code. You should only use this setting if your system has more than 4GB of physical memory per core.
  - 4: Compiler is more aggressive in using memory layout transformations. You should only use this setting if your system has more than 4GB of physical memory per core.
- -Wno-implicit-int
- EXTRA_CFLAGS
- -Wno-implicit-int is needed to allow the compiler to accept invalid C code where the type specifier is missing. With this diagnostic disabled, the missing type will be interpreted as `int`, as in C89 (the last version of C in which implicit type specifiers were allowed).
- -ljemalloc
- EXTRA_LIBS
- Linker toggle to specify jemalloc linker library. See jemalloc.net for more information.
- -L/home/cpu2017/je5.0.1-64/
- EXTRA_LIBS
- Specify build time link path for jemalloc 64bit built to support the CPU 2017 build. See jemalloc.net for more information.

Benchmarks using Fortran, C, and C++

- -w
- CC, CXX, FC, LD
- Supress compiler warnings.
- -std=c++14
- intel_icpc,intel_icx,intel_icpx
- CXX, LD
- Sets the language dialect to conform to the indicated C++ standard.
- -m64
- intel_icc,intel_icpc,intel_ifort,intel_icx,intel_icpx,intel_ifx
- CC, CXX, FC, LD
- Compiles for a 64-bit (LP64) data model.
- -std=c11
- intel_icc,intel_icx,intel_icpx
- CC
- Sets the language dialect to conform to the indicated C standard.
- -Wl,-z,muldefs
- EXTRA_LDFLAGS
- Enable SmartHeap and/or other library usage by forcing the linker to ignore multiple definitions if present
- -xAVX
- COPTIMIZE, CXXOPTIMIZE, FOPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -Ofast
- COPTIMIZE, CXXOPTIMIZE, FOPTIMIZE
- Enable O3 optimizations plus more aggressive optimizations, such as -ffinite-math-only –no-prec-div
- Includes:
  - -O3
    - -O2
      
      -O1
      
      -funroll-loops
      
      -fno-builtin
      
      -mno-ieee-fp
      
      -fomit-framepointer
      
      -ffunction-sections
      
      -ftz
- -ffast-math
- COPTIMIZE, CXXOPTIMIZE, FOPTIMIZE
- Enable fast math mode. This option may yield faster code for programs that do not require the guarantees of exact implementation of IEEE or ISO rules/specifications for math functions.
- -flto
- COPTIMIZE, CXXOPTIMIZE, FOPTIMIZE
- Performs link time optimizations, which is also known as Interprocedural Optimizations.
- -mfpmath=sse
- COPTIMIZE, CXXOPTIMIZE, FOPTIMIZE
- Generate floating-point arithmetic for selected unit unit. Here use scalar floating-point instructions present in the SSE instruction set
- -funroll-loops
- COPTIMIZE, CXXOPTIMIZE, FOPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -funroll-loops0 would disable unrolling of loops.
- -qopt-mem-layout-trans=4
- COPTIMIZE, CXXOPTIMIZE, FOPTIMIZE
- Controls the level of memory layout transformations performed by the compiler. This option can improve cache reuse and cache locality.
  - 0: Disables memory layout transformations. This is the same as specifying -qno-opt-mem-layout-trans
  - 1: Enable basic memory layout transformations like structure splitting, structure peeling, field inlining, field reordering, array field transpose, increase field alignment etc.
  - 2: Enable more memory layout transformations like advanced structure splitting. This is the same as specifying -qopt-mem-layout-trans
  - 3: Enable more memory layout transformations like copy-in/copy-out of structures for a region of code. You should only use this setting if your system has more than 4GB of physical memory per core.
  - 4: Compiler is more aggressive in using memory layout transformations. You should only use this setting if your system has more than 4GB of physical memory per core.
- -Wno-implicit-int
- EXTRA_CFLAGS
- -Wno-implicit-int is needed to allow the compiler to accept invalid C code where the type specifier is missing. With this diagnostic disabled, the missing type will be interpreted as `int`, as in C89 (the last version of C in which implicit type specifiers were allowed).
- -nostandard-realloc-lhs
- EXTRA_FOPTIMIZE
- Option standard-realloc-lhs (the default), tells the compiler that when the left-hand side of an assignment is an allocatable object, it should be reallocated to the shape of the right-hand side of the assignment before the assignment occurs. This is the current Fortran Standard definition. This feature may cause extra overhead at run time. This option has the same effect as option assume realloc_lhs.
  
  If you specify nostandard-realloc-lhs, the compiler uses the old Fortran 2003 rules when interpreting assignment statements. The left-hand side is assumed to be allocated with the correct shape to hold the right-hand side. If it is not, incorrect behavior will occur. This option has the same effect as option assume norealloc_lhs.
- -align array32byte
- EXTRA_FOPTIMIZE
- The align toggle changes how data elements are aligned. Variables and arrays are analyzed and memory layout can be altered. Specifying array32byte will look for opportunities to transform and reailgn arrays to 32byte boundaries.
- -auto
- EXTRA_FOPTIMIZE
- Make all local variables AUTOMATIC. Same as -automatic
- -ljemalloc
- EXTRA_LIBS
- Linker toggle to specify jemalloc linker library. See jemalloc.net for more information.
- -L/home/cpu2017/je5.0.1-64/
- EXTRA_LIBS
- Specify build time link path for jemalloc 64bit built to support the CPU 2017 build. See jemalloc.net for more information.

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

Commands and Options Used to Submit Benchmark Runs

Shell, Environment, and Other Software Settings

Red Hat Specific features

Operating System Tuning Parameters

For multi-copy runs or single copy runs on systems with multiple sockets, it is advantageous to bind a process to a particular core. Otherwise, the OS may arbitrarily move your process from one core to another. This can effect performance. To help, SPEC allows the use of a "submit" command where users can specify a utility to use to bind processes. We have found the utility 'numactl' to be the best choice.

numactl runs processes with a specific NUMA scheduling or memory placement policy. The policy is set for a command and inherited by all of its children. The numactl flag "--physcpubind" specifies which core(s) to bind the process. "-l" instructs numactl to keep a process memory on the local node while "-m" specifies which node(s) to place a process memory. For full details on using numactl, please refer to your Linux documentation, 'man numactl'

Launching a process with numactl --interleave=all sets the memory interleave policy so that memory will be allocated using round robin on nodes. When memory cannot be allocated on the current interleave target fall back to other nodes.

On RedHat EL 6 and later, Transparent Hugepages increase the memory page size from 4 kilobytes to 2 megabytes. Transparent Hugepages provide significant performance advantages on systems with highly contended resources and large memory workloads. If memory utilization is too high or memory is badly fragmented which prevents hugepages being allocated, the kernel will assign smaller 4k pages instead. Hugepages are used by default if /sys/kernel/mm/redhat_transparent_hugepage/enabled is set to always.

The Drive Write Cache is an option that can be enabled or disabled in the HP Array Configuration Utility, CLI version. The default value for the Drive Write Cache is set to Disabled, and in order to change this the HP Arracy Configuration Utility, CLI version needs to be installed. When the Drive Write Cache option is enabled on a HP Smart Arrary Controller in a system, it can allow the HP Smart Array Controller to help make drive writes more efficient.

The Accelerator Ratio is an option that can be set to different percentages (in 25% increments) in the HP Array Configuration Utility, CLI version. The default value for the Accelerator Ratio is set to 0% Read and 100% Write. In order to change this the HP Arracy Configuration Utility, CLI version needs to be installed. Changing the Accelerator Ratio allows the array installed on the HP Smart Arrary Controller to adjust how it priotizes reads and writes.

Sets the stack size to n kbytes, or unlimited to allow the stack size to grow without limit.

Sets the number of bytes to allocate for each parallel thread to use as its private stack. Use the optional suffix B, K, M, G, or T, to specify bytes, kilobytes, megabytes, gigabytes, or terabytes. The default setting is 2M on IA32 and 4M on IA64.

Assigns threads to consecutive physical processors (for example, cores), beginning at processor n. Specifies the static mapping of user threads to physical cores, beginning at processor n. For example, if a system is configured with 8 cores, and OMP_NUM_THREADS=8 and KMP_AFFINITY=physical,2 are set, then thread 0 will mapped to core 2, thread 1 will be mapped to core 3, and so on in a round-robin fashion.

This Environment Variable sets the maximum number of threads to use for OpenMP* parallel regions to n if no other value is specified in the application. This environment variable applies to both -openmp and -parallel (Linux) or /Qopenmp and /Qparallel (Windows). Example syntax on a Linux system with 8 cores:
export OMP_NUM_THREADS=8
Default is the number of cores visible to the OS.

The maximum number of memory map areas a process may have. Memory map areas are used as a side-effect of calling malloc, directly by mmap and mprotect, and also when loading shared libraries.

The following unused Linux services were disabled before the run in simple shell scirpt via the command "service {name} stop": abrt-ccpp, abrt-oops, abrtd, acpid, atd, auditd, autofs, avahi-daemon, cgconfig, cpuspeed, crond, cups, haldaemon, irqbalance, kdump, libvirt-guests, mcelogd, mdmonitor, messagebus, portreserve, postfix, rhnsd, rhsmcertd, rpcbind, rpcgssd, rpcidmapd, certmonger, lvm2-monitor, netfs, and sysstat.

Firmware / BIOS / Microcode Settings

One or more of the following settings may have been set. If so, the "Platform Notes" section of the report will say so; and you can read below to find out more about what these settings mean.

This feature allows enabling/disabling of logical processor cores on processors supporting Intel's Hyper-Threading Technology. This option may improve overall performance for applications that will benefit from higher processor core count.

Processor Core Disable (Intel Core Select) (Default = number of physical cores/processor):

This feature allows disabling of processor cores using Intel's Core Multi-Processing (CMP) Technology. This option allows disabling of a specific number of the cores on each physical processor. This option has the following potential uses: Reduce processor power usage and potentially improve performance/watt with some applications; improve overall performance for applications that will benefit from higher performance cores rather than more processing cores; address issues with software that is licensed on a per-core basis.

The value entered should be the number of enabled cores per socket. Valid values are 1 to 12 where 1 indicates that one core will be ENABLED per processor socket. A value of 0 is invalid as the minimum number of enabled cores per processor socket is 1.

Power Regulator for ProLiant support (Default=HP Dynamic Power Savings Mode)

Minimum Processor Idle Power Core State (Default (w/HP Power Profile=Maximum Performance)=No C-states):

This feature selects the processor's lowest idle core power state (C-state) which the operating system will utilize. The higher the C-State, the lower the power usage of that idle state (Core C6 is the lowest power idle core state supported by the processor). Values for this setting can be:

Minimum Processor Idle Power Package State (Default (w/HP Power Profile=Maximum Performance)=No Package state):

This feature selects the processor's lowest idle package power state (C-state) which is enabled. The proecessor will automatically transition into the package C-states based on the Core C-states which cores on the processor have transitioned to. The higher the package C-state, the lower the power usage of that idle package state (Package C6 (retention) is the lowed power idle package state supported by the processor). Values for this setting can be:

This option configures several processor subsystems to optimize the processor's performance and power usage. Values for this BIOS setting can be:

This BIOS option allows the enabling/disabling of the Processor Clocking Controll (PCC) Interface, for operating systems which support this feature. Enabling this option allows the Operating System to request processor frequency changes even when the server has the Power Regulator option configured for Dynamic Power Savings Mode.

For Operating Systems that do not support the PCC Interface or when the Power Regulator Mode is not configured for Dynamic Power Savings Mode, this option has no impact on system operation.

This BIOS option allows the user to disable the System ROM Power Calibration feature that is executed during the boot process. When disabled, the user can expect faster boot times but will not be able to enable a Dynamic Power Cap until this feature is re-enabled.

This option configures several memory parameters to optmizie the memory subsystems performance and power usage. Values for this BIOS setting can be:

This feature allows the user to select the fan cooling solution for the system. Values for this BIOS option can be:

This BIOS option allows allows the enabling/disabling of a processor mechanism to prefetch data into the cache according to a pattern recognition algorithm.

In some limited cases, setting this option to Disabled may improve performance. In the majority of cases, the default value of Enabled provides better performance. Users should only disable this option after performing application benchmarking to verify improved performance in their environment.

This BIOS option allows the enabling/disabling of a processor mechanism to fetch the adjacent cache line within an 128-byte sector that contains the data needed due to a cache line miss.

This BIOS option allows the enabling/disabling of iLo4 Processor State Mode Switching and Insight Power Management Processor Utilization Monitoring.

When set to disabled, the system will also set the HP Power Regulator mode to HP Static High Performance mode and the HP Power Profile mode to Custom. This option may be useful in some environments that require absolute minimum latency.

This BIOS option controls the refresh rate of the memory controller and may affect the performance and resiliency of the servers memory.

When set to 1x Refresh, the memory refresh rate will be decreased, the HP Power Regulator mode will be set to HP Static High Performance mode, and the HP Power Profile mode to Custom. This option may be useful in some environments that require absolute minimum latency.

When set to 3x Refresh, the memory refresh rate will be increased, the HP Power Regulator mode will be set to HP Static High Performance mode, and the HP Power Profile mode to Custom.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact info@spec.org
Copyright 2017-2023 Standard Performance Evaluation Corporation
Tested with SPEC CPU2017 v1.1.9.
Report generated on 2023-06-06 19:14:19 by SPEC CPU2017 flags formatter v5178.

CPU2017 Flag Description
Hewlett Packard Enterprise ProLiant DL380p Gen8 (2.50 GHz, Intel Xeon E5-2670 v2)

Test sponsored by HPE

Base Compiler Invocation

C benchmarks

C++ benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

Benchmarks using both C and C++

Benchmarks using Fortran, C, and C++

Base Portability Flags

503.bwaves_r

507.cactuBSSN_r

508.namd_r

510.parest_r

511.povray_r

519.lbm_r

521.wrf_r

526.blender_r

527.cam4_r

538.imagick_r

544.nab_r

549.fotonik3d_r

554.roms_r

Base Optimization Flags

C benchmarks

C++ benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

Benchmarks using both C and C++

Benchmarks using Fortran, C, and C++

Implicitly Included Flags

Commands and Options Used to Submit Benchmark Runs

Shell, Environment, and Other Software Settings

Red Hat Specific features

Operating System Tuning Parameters

Firmware / BIOS / Microcode Settings

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

CPU2017 Flag DescriptionHewlett Packard Enterprise ProLiant DL380p Gen8 (2.50 GHz, Intel Xeon E5-2670 v2)

Test sponsored by HPE

Base Compiler Invocation

Base Portability Flags

Base Optimization Flags

Implicitly Included Flags

Red Hat Specific features

CPU2017 Flag Description
Hewlett Packard Enterprise ProLiant DL380p Gen8 (2.50 GHz, Intel Xeon E5-2670 v2)