OMP2012 Result Flag Description

Base Optimization Flags

C benchmarks

- -Ofast
- COPTIMIZE, OPTIMIZE
- Sets certain aggressive options to improve the speed of your application.
- -fopenmp
- COPTIMIZE, OPTIMIZE
- Enables recognition of OpenMP* features and tells the parallelizer to generate multi-threaded code based on OpenMP* directives.
- -march=core-avx2
- COPTIMIZE, OPTIMIZE
- Tells the compiler to generate code for processors that support certain features. May generate instructions for processors that support the specified Intel® processor or microarchitecture code name. Keywords knl and silvermont are only available on Linux* systems. This content is specific to C++; it does not apply to DPC++. Keyword icelake is deprecated and may be removed in a future release. Indicates to the compiler the code it may generate. Possible values are:
  
  amberlake
  
  broadwell
  
  cannonlake
  
  cascadelake
  
  coffeelake
  
  goldmont
  
  goldmont-plus
  
  haswell
  
  icelake-client (or icelake)
  
  icelake-server
  
  ivybridge
  
  kabylake
  
  knl
  
  knm
  
  sandybridge
  
  silvermont
  
  skylake
  
  skylake-avx512
  
  tremont
  
  whiskeylake
  
  core-avx2 - Generates code for processors that support Intel® Advanced Vector Extensions 2 (Intel® AVX2), Intel® AVX, SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  core-avx-i - Generates code for processors that support Float-16 conversion instructions and the RDRND instruction, Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7-avx - Generates code for processors that support Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7 - Generates code for processors that support Intel® SSE4 Efficient Accelerated String and Text Processing instructions. May also generate code for Intel® SSE4 Vectorizing Compiler and Media Accelerator, Intel® SSE3, SSE2, SSE, and SSSE3 instructions.
  
  atom - Generates code for processors that support MOVBE instructions. May also generate code for SSSE3 instructions and Intel® SSE3, SSE2, and SSE instructions.
  
  core2 - Generates code for the Intel® Core™2 processor family.
  
  pentium4m - Generates for Intel® Pentium® 4 processors with MMX technology.
  
  pentium-m - Generates code for Intel® Pentium® processors. Value pentium3 is only available on Linux* systems.
  
  pentium4
  
  pentium3
  
  pentium
- -fma
- COPTIMIZE, OPTIMIZE
- Determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. This option determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. When the [Q]fma option is specified, the compiler may generate FMA instructions for combining multiply and add operations. When the negative form of the [Q]fma option is specified, the compiler must generate separate multiply and add instructions with intermediate rounding. This option has no effect unless setting CORE-AVX2 or higher is specified for option [Q]x,-march (Linux and macOS*), or /arch (Windows).
- -ipo
- COPTIMIZE, OPTIMIZE
- Enables interprocedural optimization between files. Arguments: n Is an optional integer that specifies the number of object files the compiler should create. The integer must be greater than or equal to 0.
  -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -ansi-alias
- COPTIMIZE, OPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.
- -fp-model fast=2
- COPTIMIZE, OPTIMIZE
- enable floating point model variation
  [no-]except - enable/disable floating point semantics
  fast[=1|2] - enables more aggressive floating point optimizations
  precise - allows value-safe optimizations
  source - enables intermediates in source precision
  strict - enables -fp-model precise -fp-model except, disables
  contractions and enables pragma stdc fenv_access
  double - rounds intermediates in 53-bit (double) precision
  extended - rounds intermediates in 64-bit (extended) precision
- -qno-opt-multiple-gather-scatter-by-shuffles
- COPTIMIZE, OPTIMIZE
- Enables or disables the optimization for multiple adjacent gather/scatter type vector memory references. This content is specific to C++; it does not apply to DPC++. This option controls the optimization for multiple adjacent gather/scatter type vector memory references. This optimization hint is useful for performance tuning. It tries to generate more optimal software sequences using shuffles. If you specify this option, the compiler will apply the optimization heuristics. If you specify -qno-opt-multiple-gather-scatter-by-shuffles or /Qopt-multiple-gather-scatter-by-shuffles-, the compiler will not apply the optimization.
- -qopt-zmm-usage=high
- COPTIMIZE, OPTIMIZE
- Defines a level of zmm registers usage.
  -qopt-zmm-usage=keywoard Specifies the level of zmm register usage. You can specify one of the following:
  
  low - Tells the compiler that the compiled program is unlikely to benefit from zmm register usage. It specifies that the compiler should avoid using zmm register unless it can prove the gain from their usage.
  
  high - Tells the compiler to generate zmm code without restrictions
- -ffast-math
- COPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fstrict-enums
- COPTIMIZE
- Enable optimizations based on the strict definition of an enum's value range.
- -fstrict-vtable-pointers
- COPTIMIZE
- Enable optimizations based on the strict rules for overwriting polymorphic C++ objects.
- -fvirtual-function-elimination
- COPTIMIZE
- Enables dead virtual function elimination optimization. Requires -flto=full.

C++ benchmarks

- -Ofast
- CXXOPTIMIZE, OPTIMIZE
- Sets certain aggressive options to improve the speed of your application.
- -fopenmp
- CXXOPTIMIZE, OPTIMIZE
- Enables recognition of OpenMP* features and tells the parallelizer to generate multi-threaded code based on OpenMP* directives.
- -march=core-avx2
- CXXOPTIMIZE, OPTIMIZE
- Tells the compiler to generate code for processors that support certain features. May generate instructions for processors that support the specified Intel® processor or microarchitecture code name. Keywords knl and silvermont are only available on Linux* systems. This content is specific to C++; it does not apply to DPC++. Keyword icelake is deprecated and may be removed in a future release. Indicates to the compiler the code it may generate. Possible values are:
  
  amberlake
  
  broadwell
  
  cannonlake
  
  cascadelake
  
  coffeelake
  
  goldmont
  
  goldmont-plus
  
  haswell
  
  icelake-client (or icelake)
  
  icelake-server
  
  ivybridge
  
  kabylake
  
  knl
  
  knm
  
  sandybridge
  
  silvermont
  
  skylake
  
  skylake-avx512
  
  tremont
  
  whiskeylake
  
  core-avx2 - Generates code for processors that support Intel® Advanced Vector Extensions 2 (Intel® AVX2), Intel® AVX, SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  core-avx-i - Generates code for processors that support Float-16 conversion instructions and the RDRND instruction, Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7-avx - Generates code for processors that support Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7 - Generates code for processors that support Intel® SSE4 Efficient Accelerated String and Text Processing instructions. May also generate code for Intel® SSE4 Vectorizing Compiler and Media Accelerator, Intel® SSE3, SSE2, SSE, and SSSE3 instructions.
  
  atom - Generates code for processors that support MOVBE instructions. May also generate code for SSSE3 instructions and Intel® SSE3, SSE2, and SSE instructions.
  
  core2 - Generates code for the Intel® Core™2 processor family.
  
  pentium4m - Generates for Intel® Pentium® 4 processors with MMX technology.
  
  pentium-m - Generates code for Intel® Pentium® processors. Value pentium3 is only available on Linux* systems.
  
  pentium4
  
  pentium3
  
  pentium
- -fma
- CXXOPTIMIZE, OPTIMIZE
- Determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. This option determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. When the [Q]fma option is specified, the compiler may generate FMA instructions for combining multiply and add operations. When the negative form of the [Q]fma option is specified, the compiler must generate separate multiply and add instructions with intermediate rounding. This option has no effect unless setting CORE-AVX2 or higher is specified for option [Q]x,-march (Linux and macOS*), or /arch (Windows).
- -ipo
- CXXOPTIMIZE, OPTIMIZE
- Enables interprocedural optimization between files. Arguments: n Is an optional integer that specifies the number of object files the compiler should create. The integer must be greater than or equal to 0.
  -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -ansi-alias
- CXXOPTIMIZE, OPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.
- -fp-model fast=2
- CXXOPTIMIZE, OPTIMIZE
- enable floating point model variation
  [no-]except - enable/disable floating point semantics
  fast[=1|2] - enables more aggressive floating point optimizations
  precise - allows value-safe optimizations
  source - enables intermediates in source precision
  strict - enables -fp-model precise -fp-model except, disables
  contractions and enables pragma stdc fenv_access
  double - rounds intermediates in 53-bit (double) precision
  extended - rounds intermediates in 64-bit (extended) precision
- -qno-opt-multiple-gather-scatter-by-shuffles
- CXXOPTIMIZE, OPTIMIZE
- Enables or disables the optimization for multiple adjacent gather/scatter type vector memory references. This content is specific to C++; it does not apply to DPC++. This option controls the optimization for multiple adjacent gather/scatter type vector memory references. This optimization hint is useful for performance tuning. It tries to generate more optimal software sequences using shuffles. If you specify this option, the compiler will apply the optimization heuristics. If you specify -qno-opt-multiple-gather-scatter-by-shuffles or /Qopt-multiple-gather-scatter-by-shuffles-, the compiler will not apply the optimization.
- -qopt-zmm-usage=high
- CXXOPTIMIZE, OPTIMIZE
- Defines a level of zmm registers usage.
  -qopt-zmm-usage=keywoard Specifies the level of zmm register usage. You can specify one of the following:
  
  low - Tells the compiler that the compiled program is unlikely to benefit from zmm register usage. It specifies that the compiler should avoid using zmm register unless it can prove the gain from their usage.
  
  high - Tells the compiler to generate zmm code without restrictions
- -ffast-math
- CXXOPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fstrict-enums
- CXXOPTIMIZE
- Enable optimizations based on the strict definition of an enum's value range.
- -fstrict-vtable-pointers
- CXXOPTIMIZE
- Enable optimizations based on the strict rules for overwriting polymorphic C++ objects.

Fortran benchmarks

- -Ofast
- FOPTIMIZE, OPTIMIZE
- Sets certain aggressive options to improve the speed of your application.
- -fopenmp
- FOPTIMIZE, OPTIMIZE
- Enables recognition of OpenMP* features and tells the parallelizer to generate multi-threaded code based on OpenMP* directives.
- -march=core-avx2
- FOPTIMIZE, OPTIMIZE
- Tells the compiler to generate code for processors that support certain features. May generate instructions for processors that support the specified Intel® processor or microarchitecture code name. Keywords knl and silvermont are only available on Linux* systems. This content is specific to C++; it does not apply to DPC++. Keyword icelake is deprecated and may be removed in a future release. Indicates to the compiler the code it may generate. Possible values are:
  
  amberlake
  
  broadwell
  
  cannonlake
  
  cascadelake
  
  coffeelake
  
  goldmont
  
  goldmont-plus
  
  haswell
  
  icelake-client (or icelake)
  
  icelake-server
  
  ivybridge
  
  kabylake
  
  knl
  
  knm
  
  sandybridge
  
  silvermont
  
  skylake
  
  skylake-avx512
  
  tremont
  
  whiskeylake
  
  core-avx2 - Generates code for processors that support Intel® Advanced Vector Extensions 2 (Intel® AVX2), Intel® AVX, SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  core-avx-i - Generates code for processors that support Float-16 conversion instructions and the RDRND instruction, Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7-avx - Generates code for processors that support Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7 - Generates code for processors that support Intel® SSE4 Efficient Accelerated String and Text Processing instructions. May also generate code for Intel® SSE4 Vectorizing Compiler and Media Accelerator, Intel® SSE3, SSE2, SSE, and SSSE3 instructions.
  
  atom - Generates code for processors that support MOVBE instructions. May also generate code for SSSE3 instructions and Intel® SSE3, SSE2, and SSE instructions.
  
  core2 - Generates code for the Intel® Core™2 processor family.
  
  pentium4m - Generates for Intel® Pentium® 4 processors with MMX technology.
  
  pentium-m - Generates code for Intel® Pentium® processors. Value pentium3 is only available on Linux* systems.
  
  pentium4
  
  pentium3
  
  pentium
- -fma
- FOPTIMIZE, OPTIMIZE
- Determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. This option determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. When the [Q]fma option is specified, the compiler may generate FMA instructions for combining multiply and add operations. When the negative form of the [Q]fma option is specified, the compiler must generate separate multiply and add instructions with intermediate rounding. This option has no effect unless setting CORE-AVX2 or higher is specified for option [Q]x,-march (Linux and macOS*), or /arch (Windows).
- -ipo
- FOPTIMIZE, OPTIMIZE
- Enables interprocedural optimization between files. Arguments: n Is an optional integer that specifies the number of object files the compiler should create. The integer must be greater than or equal to 0.
  -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -ansi-alias
- FOPTIMIZE, OPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.
- -fp-model fast=2
- FOPTIMIZE, OPTIMIZE
- enable floating point model variation
  [no-]except - enable/disable floating point semantics
  fast[=1|2] - enables more aggressive floating point optimizations
  precise - allows value-safe optimizations
  source - enables intermediates in source precision
  strict - enables -fp-model precise -fp-model except, disables
  contractions and enables pragma stdc fenv_access
  double - rounds intermediates in 53-bit (double) precision
  extended - rounds intermediates in 64-bit (extended) precision
- -qno-opt-multiple-gather-scatter-by-shuffles
- FOPTIMIZE, OPTIMIZE
- Enables or disables the optimization for multiple adjacent gather/scatter type vector memory references. This content is specific to C++; it does not apply to DPC++. This option controls the optimization for multiple adjacent gather/scatter type vector memory references. This optimization hint is useful for performance tuning. It tries to generate more optimal software sequences using shuffles. If you specify this option, the compiler will apply the optimization heuristics. If you specify -qno-opt-multiple-gather-scatter-by-shuffles or /Qopt-multiple-gather-scatter-by-shuffles-, the compiler will not apply the optimization.
- -qopt-zmm-usage=high
- FOPTIMIZE, OPTIMIZE
- Defines a level of zmm registers usage.
  -qopt-zmm-usage=keywoard Specifies the level of zmm register usage. You can specify one of the following:
  
  low - Tells the compiler that the compiled program is unlikely to benefit from zmm register usage. It specifies that the compiler should avoid using zmm register unless it can prove the gain from their usage.
  
  high - Tells the compiler to generate zmm code without restrictions
- -align array128byte
- FOPTIMIZE
- specify how data items are aligned
  keywords: all (same as -align), none (same as -noalign),
  [no]commons, [no]dcommons,
  [no]qcommons, [no]zcommons,
  rec1byte, rec2byte, rec4byte,
  rec8byte, rec16byte, rec32byte,
  array8byte, array16byte, array32byte,
  array64byte, array128byte, array256byte,
  [no]records, [no]sequence
- -ffinite-math-only
- FOPTIMIZE
- Allow optimizations for floating point arithmetic that assume arguments and results are not NaNs or Infinities.
- -fno-omit-frame-pointer
- FOPTIMIZE
- Determines whether EBP is used as a general-purpose register in optimizations.
- -m64
- FOPTIMIZE
- Tells the compiler to generate code for Intel® 64 architecture.
- -ipo1
- FOPTIMIZE
- Enables interprocedural optimization between files. Arguments: n Is an optional integer that specifies the number of object files the compiler should create. The integer must be greater than or equal to 0.
  -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -foptimize-sibling-calls
- FOPTIMIZE
- Determines whether the compiler optimizes tail recursive calls. This feature is only available for ifort. This option determines whether the compiler optimizes tail recursive calls. It enables conversion of tail recursion into loops.
- -vec
- FOPTIMIZE
- Enables or disables vectorization. To disable vectorization, specify -no-vec (Linux* and macOS) or /Qvec- (Windows*). To disable interpretation of SIMD directives, specify -no-simd (Linux* and macOS) or /Qsimd- (Windows*). To disable all compiler vectorization, use the "-no-vec -no-simd" (Linux* and macOS) or "/Qvec- /Qsimd-" (Windows*) compiler options. The option -no-vec (and /Qvec-) disables all auto-vectorization, including vectorization of array notation statements. The option -no-simd (and /Qsimd-) disables vectorization of loops that have SIMD directives.

Peak Optimization Flags

C benchmarks

- -Ofast
- COPTIMIZE, OPTIMIZE
- Sets certain aggressive options to improve the speed of your application.
- -fopenmp
- COPTIMIZE, OPTIMIZE
- Enables recognition of OpenMP* features and tells the parallelizer to generate multi-threaded code based on OpenMP* directives.
- -march=core-avx2
- COPTIMIZE, OPTIMIZE
- Tells the compiler to generate code for processors that support certain features. May generate instructions for processors that support the specified Intel® processor or microarchitecture code name. Keywords knl and silvermont are only available on Linux* systems. This content is specific to C++; it does not apply to DPC++. Keyword icelake is deprecated and may be removed in a future release. Indicates to the compiler the code it may generate. Possible values are:
  
  amberlake
  
  broadwell
  
  cannonlake
  
  cascadelake
  
  coffeelake
  
  goldmont
  
  goldmont-plus
  
  haswell
  
  icelake-client (or icelake)
  
  icelake-server
  
  ivybridge
  
  kabylake
  
  knl
  
  knm
  
  sandybridge
  
  silvermont
  
  skylake
  
  skylake-avx512
  
  tremont
  
  whiskeylake
  
  core-avx2 - Generates code for processors that support Intel® Advanced Vector Extensions 2 (Intel® AVX2), Intel® AVX, SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  core-avx-i - Generates code for processors that support Float-16 conversion instructions and the RDRND instruction, Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7-avx - Generates code for processors that support Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7 - Generates code for processors that support Intel® SSE4 Efficient Accelerated String and Text Processing instructions. May also generate code for Intel® SSE4 Vectorizing Compiler and Media Accelerator, Intel® SSE3, SSE2, SSE, and SSSE3 instructions.
  
  atom - Generates code for processors that support MOVBE instructions. May also generate code for SSSE3 instructions and Intel® SSE3, SSE2, and SSE instructions.
  
  core2 - Generates code for the Intel® Core™2 processor family.
  
  pentium4m - Generates for Intel® Pentium® 4 processors with MMX technology.
  
  pentium-m - Generates code for Intel® Pentium® processors. Value pentium3 is only available on Linux* systems.
  
  pentium4
  
  pentium3
  
  pentium
- -fma
- COPTIMIZE, OPTIMIZE
- Determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. This option determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. When the [Q]fma option is specified, the compiler may generate FMA instructions for combining multiply and add operations. When the negative form of the [Q]fma option is specified, the compiler must generate separate multiply and add instructions with intermediate rounding. This option has no effect unless setting CORE-AVX2 or higher is specified for option [Q]x,-march (Linux and macOS*), or /arch (Windows).
- -ipo
- COPTIMIZE, OPTIMIZE
- Enables interprocedural optimization between files. Arguments: n Is an optional integer that specifies the number of object files the compiler should create. The integer must be greater than or equal to 0.
  -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -ansi-alias
- COPTIMIZE, OPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.
- -fp-model fast=2
- COPTIMIZE, OPTIMIZE
- enable floating point model variation
  [no-]except - enable/disable floating point semantics
  fast[=1|2] - enables more aggressive floating point optimizations
  precise - allows value-safe optimizations
  source - enables intermediates in source precision
  strict - enables -fp-model precise -fp-model except, disables
  contractions and enables pragma stdc fenv_access
  double - rounds intermediates in 53-bit (double) precision
  extended - rounds intermediates in 64-bit (extended) precision
- -qno-opt-multiple-gather-scatter-by-shuffles
- COPTIMIZE, OPTIMIZE
- Enables or disables the optimization for multiple adjacent gather/scatter type vector memory references. This content is specific to C++; it does not apply to DPC++. This option controls the optimization for multiple adjacent gather/scatter type vector memory references. This optimization hint is useful for performance tuning. It tries to generate more optimal software sequences using shuffles. If you specify this option, the compiler will apply the optimization heuristics. If you specify -qno-opt-multiple-gather-scatter-by-shuffles or /Qopt-multiple-gather-scatter-by-shuffles-, the compiler will not apply the optimization.
- -qopt-zmm-usage=high
- COPTIMIZE, OPTIMIZE
- Defines a level of zmm registers usage.
  -qopt-zmm-usage=keywoard Specifies the level of zmm register usage. You can specify one of the following:
  
  low - Tells the compiler that the compiled program is unlikely to benefit from zmm register usage. It specifies that the compiler should avoid using zmm register unless it can prove the gain from their usage.
  
  high - Tells the compiler to generate zmm code without restrictions
- -ffast-math
- COPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fstrict-enums
- COPTIMIZE
- Enable optimizations based on the strict definition of an enum's value range.
- -fstrict-vtable-pointers
- COPTIMIZE
- Enable optimizations based on the strict rules for overwriting polymorphic C++ objects.
- -fvirtual-function-elimination
- COPTIMIZE
- Enables dead virtual function elimination optimization. Requires -flto=full.

C++ benchmarks

- -Ofast
- CXXOPTIMIZE, OPTIMIZE
- Sets certain aggressive options to improve the speed of your application.
- -fopenmp
- CXXOPTIMIZE, OPTIMIZE
- Enables recognition of OpenMP* features and tells the parallelizer to generate multi-threaded code based on OpenMP* directives.
- -march=core-avx2
- CXXOPTIMIZE, OPTIMIZE
- Tells the compiler to generate code for processors that support certain features. May generate instructions for processors that support the specified Intel® processor or microarchitecture code name. Keywords knl and silvermont are only available on Linux* systems. This content is specific to C++; it does not apply to DPC++. Keyword icelake is deprecated and may be removed in a future release. Indicates to the compiler the code it may generate. Possible values are:
  
  amberlake
  
  broadwell
  
  cannonlake
  
  cascadelake
  
  coffeelake
  
  goldmont
  
  goldmont-plus
  
  haswell
  
  icelake-client (or icelake)
  
  icelake-server
  
  ivybridge
  
  kabylake
  
  knl
  
  knm
  
  sandybridge
  
  silvermont
  
  skylake
  
  skylake-avx512
  
  tremont
  
  whiskeylake
  
  core-avx2 - Generates code for processors that support Intel® Advanced Vector Extensions 2 (Intel® AVX2), Intel® AVX, SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  core-avx-i - Generates code for processors that support Float-16 conversion instructions and the RDRND instruction, Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7-avx - Generates code for processors that support Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7 - Generates code for processors that support Intel® SSE4 Efficient Accelerated String and Text Processing instructions. May also generate code for Intel® SSE4 Vectorizing Compiler and Media Accelerator, Intel® SSE3, SSE2, SSE, and SSSE3 instructions.
  
  atom - Generates code for processors that support MOVBE instructions. May also generate code for SSSE3 instructions and Intel® SSE3, SSE2, and SSE instructions.
  
  core2 - Generates code for the Intel® Core™2 processor family.
  
  pentium4m - Generates for Intel® Pentium® 4 processors with MMX technology.
  
  pentium-m - Generates code for Intel® Pentium® processors. Value pentium3 is only available on Linux* systems.
  
  pentium4
  
  pentium3
  
  pentium
- -fma
- CXXOPTIMIZE, OPTIMIZE
- Determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. This option determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. When the [Q]fma option is specified, the compiler may generate FMA instructions for combining multiply and add operations. When the negative form of the [Q]fma option is specified, the compiler must generate separate multiply and add instructions with intermediate rounding. This option has no effect unless setting CORE-AVX2 or higher is specified for option [Q]x,-march (Linux and macOS*), or /arch (Windows).
- -ipo
- CXXOPTIMIZE, OPTIMIZE
- Enables interprocedural optimization between files. Arguments: n Is an optional integer that specifies the number of object files the compiler should create. The integer must be greater than or equal to 0.
  -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -ansi-alias
- CXXOPTIMIZE, OPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.
- -fp-model fast=2
- CXXOPTIMIZE, OPTIMIZE
- enable floating point model variation
  [no-]except - enable/disable floating point semantics
  fast[=1|2] - enables more aggressive floating point optimizations
  precise - allows value-safe optimizations
  source - enables intermediates in source precision
  strict - enables -fp-model precise -fp-model except, disables
  contractions and enables pragma stdc fenv_access
  double - rounds intermediates in 53-bit (double) precision
  extended - rounds intermediates in 64-bit (extended) precision
- -qno-opt-multiple-gather-scatter-by-shuffles
- CXXOPTIMIZE, OPTIMIZE
- Enables or disables the optimization for multiple adjacent gather/scatter type vector memory references. This content is specific to C++; it does not apply to DPC++. This option controls the optimization for multiple adjacent gather/scatter type vector memory references. This optimization hint is useful for performance tuning. It tries to generate more optimal software sequences using shuffles. If you specify this option, the compiler will apply the optimization heuristics. If you specify -qno-opt-multiple-gather-scatter-by-shuffles or /Qopt-multiple-gather-scatter-by-shuffles-, the compiler will not apply the optimization.
- -qopt-zmm-usage=high
- CXXOPTIMIZE, OPTIMIZE
- Defines a level of zmm registers usage.
  -qopt-zmm-usage=keywoard Specifies the level of zmm register usage. You can specify one of the following:
  
  low - Tells the compiler that the compiled program is unlikely to benefit from zmm register usage. It specifies that the compiler should avoid using zmm register unless it can prove the gain from their usage.
  
  high - Tells the compiler to generate zmm code without restrictions
- -ffast-math
- CXXOPTIMIZE
- Allow aggressive, lossy floating-point optimizations.
- -fstrict-enums
- CXXOPTIMIZE
- Enable optimizations based on the strict definition of an enum's value range.
- -fstrict-vtable-pointers
- CXXOPTIMIZE
- Enable optimizations based on the strict rules for overwriting polymorphic C++ objects.

Fortran benchmarks

- -Ofast
- FOPTIMIZE, OPTIMIZE
- Sets certain aggressive options to improve the speed of your application.
- -fopenmp
- FOPTIMIZE, OPTIMIZE
- Enables recognition of OpenMP* features and tells the parallelizer to generate multi-threaded code based on OpenMP* directives.
- -march=core-avx2
- FOPTIMIZE, OPTIMIZE
- Tells the compiler to generate code for processors that support certain features. May generate instructions for processors that support the specified Intel® processor or microarchitecture code name. Keywords knl and silvermont are only available on Linux* systems. This content is specific to C++; it does not apply to DPC++. Keyword icelake is deprecated and may be removed in a future release. Indicates to the compiler the code it may generate. Possible values are:
  
  amberlake
  
  broadwell
  
  cannonlake
  
  cascadelake
  
  coffeelake
  
  goldmont
  
  goldmont-plus
  
  haswell
  
  icelake-client (or icelake)
  
  icelake-server
  
  ivybridge
  
  kabylake
  
  knl
  
  knm
  
  sandybridge
  
  silvermont
  
  skylake
  
  skylake-avx512
  
  tremont
  
  whiskeylake
  
  core-avx2 - Generates code for processors that support Intel® Advanced Vector Extensions 2 (Intel® AVX2), Intel® AVX, SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  core-avx-i - Generates code for processors that support Float-16 conversion instructions and the RDRND instruction, Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7-avx - Generates code for processors that support Intel® Advanced Vector Extensions (Intel® AVX), Intel® SSE4.2, SSE4.1, SSE3, SSE2, SSE, and SSSE3 instructions.
  
  corei7 - Generates code for processors that support Intel® SSE4 Efficient Accelerated String and Text Processing instructions. May also generate code for Intel® SSE4 Vectorizing Compiler and Media Accelerator, Intel® SSE3, SSE2, SSE, and SSSE3 instructions.
  
  atom - Generates code for processors that support MOVBE instructions. May also generate code for SSSE3 instructions and Intel® SSE3, SSE2, and SSE instructions.
  
  core2 - Generates code for the Intel® Core™2 processor family.
  
  pentium4m - Generates for Intel® Pentium® 4 processors with MMX technology.
  
  pentium-m - Generates code for Intel® Pentium® processors. Value pentium3 is only available on Linux* systems.
  
  pentium4
  
  pentium3
  
  pentium
- -fma
- FOPTIMIZE, OPTIMIZE
- Determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. This option determines whether the compiler generates fused multiply-add (FMA) instructions if such instructions exist on the target processor. When the [Q]fma option is specified, the compiler may generate FMA instructions for combining multiply and add operations. When the negative form of the [Q]fma option is specified, the compiler must generate separate multiply and add instructions with intermediate rounding. This option has no effect unless setting CORE-AVX2 or higher is specified for option [Q]x,-march (Linux and macOS*), or /arch (Windows).
- -ipo
- FOPTIMIZE, OPTIMIZE
- Enables interprocedural optimization between files. Arguments: n Is an optional integer that specifies the number of object files the compiler should create. The integer must be greater than or equal to 0.
  -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -ansi-alias
- FOPTIMIZE, OPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.
- -fp-model fast=2
- FOPTIMIZE, OPTIMIZE
- enable floating point model variation
  [no-]except - enable/disable floating point semantics
  fast[=1|2] - enables more aggressive floating point optimizations
  precise - allows value-safe optimizations
  source - enables intermediates in source precision
  strict - enables -fp-model precise -fp-model except, disables
  contractions and enables pragma stdc fenv_access
  double - rounds intermediates in 53-bit (double) precision
  extended - rounds intermediates in 64-bit (extended) precision
- -qno-opt-multiple-gather-scatter-by-shuffles
- FOPTIMIZE, OPTIMIZE
- Enables or disables the optimization for multiple adjacent gather/scatter type vector memory references. This content is specific to C++; it does not apply to DPC++. This option controls the optimization for multiple adjacent gather/scatter type vector memory references. This optimization hint is useful for performance tuning. It tries to generate more optimal software sequences using shuffles. If you specify this option, the compiler will apply the optimization heuristics. If you specify -qno-opt-multiple-gather-scatter-by-shuffles or /Qopt-multiple-gather-scatter-by-shuffles-, the compiler will not apply the optimization.
- -qopt-zmm-usage=high
- FOPTIMIZE, OPTIMIZE
- Defines a level of zmm registers usage.
  -qopt-zmm-usage=keywoard Specifies the level of zmm register usage. You can specify one of the following:
  
  low - Tells the compiler that the compiled program is unlikely to benefit from zmm register usage. It specifies that the compiler should avoid using zmm register unless it can prove the gain from their usage.
  
  high - Tells the compiler to generate zmm code without restrictions
- -align array128byte
- FOPTIMIZE
- specify how data items are aligned
  keywords: all (same as -align), none (same as -noalign),
  [no]commons, [no]dcommons,
  [no]qcommons, [no]zcommons,
  rec1byte, rec2byte, rec4byte,
  rec8byte, rec16byte, rec32byte,
  array8byte, array16byte, array32byte,
  array64byte, array128byte, array256byte,
  [no]records, [no]sequence
- -ffinite-math-only
- FOPTIMIZE
- Allow optimizations for floating point arithmetic that assume arguments and results are not NaNs or Infinities.
- -fno-omit-frame-pointer
- FOPTIMIZE
- Determines whether EBP is used as a general-purpose register in optimizations.
- -m64
- FOPTIMIZE
- Tells the compiler to generate code for Intel® 64 architecture.
- -ipo1
- FOPTIMIZE
- Enables interprocedural optimization between files. Arguments: n Is an optional integer that specifies the number of object files the compiler should create. The integer must be greater than or equal to 0.
  -ipo[n]
  Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
  (n - number of multi-file objects)
- -foptimize-sibling-calls
- FOPTIMIZE
- Determines whether the compiler optimizes tail recursive calls. This feature is only available for ifort. This option determines whether the compiler optimizes tail recursive calls. It enables conversion of tail recursion into loops.
- -vec
- FOPTIMIZE
- Enables or disables vectorization. To disable vectorization, specify -no-vec (Linux* and macOS) or /Qvec- (Windows*). To disable interpretation of SIMD directives, specify -no-simd (Linux* and macOS) or /Qsimd- (Windows*). To disable all compiler vectorization, use the "-no-vec -no-simd" (Linux* and macOS) or "/Qvec- /Qsimd-" (Windows*) compiler options. The option -no-vec (and /Qvec-) disables all auto-vectorization, including vectorization of array notation statements. The option -no-simd (and /Qsimd-) disables vectorization of loops that have SIMD directives.

Shell, Environment, and Other Software Settings

Open MP Tuning Flags

Syntax
KMP_AFFINITY=[<modifier>,...]<type>[,<permute>][,<offset>]

Argument	Default	Description
modifier	noverbose respect granularity=core	Optional. String consisting of keyword and specifier. granularity=<specifier> takes the following specifiers: fine, thread, and core norespect noverbose nowarnings proclist={<proc-list>} respect verbose warnings
type	none	Required string. Indicates the thread affinity to use. compact disabled explicit none scatter logical (deprecated; instead use compact, but omit any permute value) physical (deprecated; instead use scatter, possibly with an offset value) The logical and physical types are deprecated but supported for backward compatibility.
permute	0	Optional. Positive integer value. Not valid with type values of explicit, none, or disabled.
offset	0	Optional. Positive integer value. Not valid with type values of explicit, none, or disabled.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2012-2023 Standard Performance Evaluation Corporation
Tested with SPEC OMP2012 v1.1.
Report generated on Wed Aug 16 14:58:20 2023 by SPEC OMP2012 flags formatter v538.

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

OMP2012 Flag DescriptionLenovo Global Technology ThinkSystem SR655V3 (AMD EPYC 9754, 2.25GHz)

Base Compiler Invocation

Peak Compiler Invocation

Base Portability Flags

Peak Portability Flags

Base Optimization Flags

Peak Optimization Flags

Syntax

Default

Description

Affinity Types

type = none (default)

type = compact

type = disabled

type = explicit

type = scatter

Deprecated Types: logical and physical

Permute and offset combinations

Modifier Values for Affinity Types

modifier = noverbose (default)

modifier = verbose

Execution modes

Serial

Turnaround

Throughput

OMP2012 Flag Description
Lenovo Global Technology ThinkSystem SR655V3 (AMD EPYC 9754, 2.25GHz)