CPU2006 Result Flag Description

Base Optimization Flags

C benchmarks

- -xCORE-AVX2
- COPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -ipo
- COPTIMIZE
- Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
- -O3
- COPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On IA-32 and Intel EM64T processors, when O3 is used with options -ax or -x (Linux) or with options /Qax or /Qx (Windows), the compiler performs more aggressive data dependency analysis than for O2, which may result in longer compilation times. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations. The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -funroll-loops
      
      -fno-builtin
      
      -mno-ieee-fp
      
      -fomit-framepointer
      
      -ffunction-sections
      
      -ftz
- -no-prec-div
- COPTIMIZE
- (disable/enable[default] -prec-div)
  -no-prec-div enables optimizations that give slightly less precise results than full IEEE division.
  
  When you specify -no-prec-div along with some optimizations, such as -xN and -xB (Linux) or /QxN and /QxB (Windows), the compiler may change floating-point division computations into multiplication by the reciprocal of the denominator. For example, A/B is computed as A * (1/B) to improve the speed of the computation.
  
  However, sometimes the value produced by this transformation is not as accurate as full IEEE division. When it is important to have fully precise IEEE division, do not use -no-prec-div. This will enable the default -prec-div and the result will be more accurate, with some loss of performance.
- -opt-prefetch
- COPTIMIZE
- Enable/disable(DEFAULT) the compiler to generate prefetch instructions to prefetch data.
- -opt-mem-layout-trans=3
- COPTIMIZE
- Controls the level of memory layout transformations performed by the compiler. This option can improve cache reuse and cache locality.
  - 0: Disables memory layout transformations. This is the same as specifying -no-opt-mem-layout-trans
  - 1: Enables basic memory layout transformations like structure splitting, structure peeling, field inlining, field reordering, array field transpose, increase field alignment etc.
  - 2: Enables more memory layout transformations like advanced structure splitting. This is the same as specifying -opt-mem-layout-trans
  - 3: Compiler is more aggressive in using memory layout transformations. You should only use this setting if your system has more than 4GB of physical memory per core.

C++ benchmarks

- -xCORE-AVX2
- CXXOPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -ipo
- CXXOPTIMIZE
- Multi-file ip optimizations that includes:
  - inline function expansion
  - interprocedural constant propogation
  - dead code elimination
  - propagation of function characteristics
  - passing arguments in registers
  - loop-invariant code motion
- -O3
- CXXOPTIMIZE
- Enables O2 optimizations plus more aggressive optimizations, such as prefetching, scalar replacement, and loop and memory access transformations. Enables optimizations for maximum speed, such as:
  - Loop unrolling, including instruction scheduling
  - Code replication to eliminate branches
  - Padding the size of certain power-of-two arrays to allow more efficient cache use.
  On IA-32 and Intel EM64T processors, when O3 is used with options -ax or -x (Linux) or with options /Qax or /Qx (Windows), the compiler performs more aggressive data dependency analysis than for O2, which may result in longer compilation times. The O3 optimizations may not cause higher performance unless loop and memory access transformations take place. The optimizations may slow down code in some cases compared to O2 optimizations. The O3 option is recommended for applications that have loops that heavily use floating-point calculations and process large data sets.
- Includes:
  - -O2
    - -O1
      
      -funroll-loops
      
      -fno-builtin
      
      -mno-ieee-fp
      
      -fomit-framepointer
      
      -ffunction-sections
      
      -ftz
- -no-prec-div
- CXXOPTIMIZE
- (disable/enable[default] -prec-div)
  -no-prec-div enables optimizations that give slightly less precise results than full IEEE division.
  
  When you specify -no-prec-div along with some optimizations, such as -xN and -xB (Linux) or /QxN and /QxB (Windows), the compiler may change floating-point division computations into multiplication by the reciprocal of the denominator. For example, A/B is computed as A * (1/B) to improve the speed of the computation.
  
  However, sometimes the value produced by this transformation is not as accurate as full IEEE division. When it is important to have fully precise IEEE division, do not use -no-prec-div. This will enable the default -prec-div and the result will be more accurate, with some loss of performance.
- -opt-prefetch
- CXXOPTIMIZE
- Enable/disable(DEFAULT) the compiler to generate prefetch instructions to prefetch data.
- -opt-mem-layout-trans=3
- CXXOPTIMIZE
- Controls the level of memory layout transformations performed by the compiler. This option can improve cache reuse and cache locality.
  - 0: Disables memory layout transformations. This is the same as specifying -no-opt-mem-layout-trans
  - 1: Enables basic memory layout transformations like structure splitting, structure peeling, field inlining, field reordering, array field transpose, increase field alignment etc.
  - 2: Enables more memory layout transformations like advanced structure splitting. This is the same as specifying -opt-mem-layout-trans
  - 3: Compiler is more aggressive in using memory layout transformations. You should only use this setting if your system has more than 4GB of physical memory per core.

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

Commands and Options Used to Submit Benchmark Runs

This result has been formatted using multiple flags files. The "submit command" from each of them appears next.

Submit command from Intel-ic14.0-official-linux64-revC

SPEC CPU2006 Flag Description for the Intel(R) C++ and Fortran Compiler 14.0 for IA32 and Intel 64 applications

Submit command from EXO-platform-flags-HSW-v1.1

SPEC CPU2006 Flag Description for EXO Platform

For multi-copy runs or single copy runs on systems with multiple sockets, it is advantageous to bind a process to a particular core. Otherwise, the OS may arbitrarily move your process from one core to another. This can effect performance. To help, SPEC allows the use of a "submit" command where users can specify a utility to use to bind processes. We have found the utility 'numactl' to be the best choice.

numactl runs processes with a specific NUMA scheduling or memory placement policy. The policy is set for a command and inherited by all of its children. The numactl flag "--physcpubind" specifies which core(s) to bind the process. "-l" instructs numactl to keep a process memory on the local node while "-m" specifies which node(s) to place a process memory. For full details on using numactl, please refer to your Linux documentation, 'man numactl'

Note that some versions of numactl, particularly the version found on SLES 10, we have found that the utility incorrectly interprets application arguments as it's own. For example, with the command "numactl --physcpubind=0 -l a.out -m a", numactl will interpret a.out's "-m" option as it's own "-m" option. To work around this problem, a user can put the command to be run in a shell script and then run the shell script using numactl. For example: "echo 'a.out -m a' > run.sh ; numactl --physcpubind=0 bash run.sh"

Shell, Environment, and Other Software Settings

This result has been formatted using multiple flags files. The "sw environment" from each of them appears next.

Sw environment from Intel-ic14.0-official-linux64-revC

SPEC CPU2006 Flag Description for the Intel(R) C++ and Fortran Compiler 14.0 for IA32 and Intel 64 applications

Red Hat Specific features

Sw environment from EXO-platform-flags-HSW-v1.1

SPEC CPU2006 Flag Description for EXO Platform

Sets the stack size to n kbytes, or unlimited to allow the stack size to grow without limit.

Firmware / BIOS / Microcode Settings

Disabling Intel's Hyper-Threading Technology in BIOS reduces the number of threads per core to 1. The default is Enabled; in this case each core provides additional resources for executing up to 2 threads in parallel.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2006-2016 Standard Performance Evaluation Corporation
Tested with SPEC CPU2006 v1.2.
Report generated on Thu Jun 30 13:26:47 2016 by SPEC CPU2006 flags formatter v6906.

CPU2006 Flag Description
EXO S.A. SERVER EXO SPRINT X385

Base Compiler Invocation

C benchmarks

C++ benchmarks

Base Portability Flags

400.perlbench

462.libquantum

483.xalancbmk

Base Optimization Flags

C benchmarks

C++ benchmarks

Base Other Flags

C benchmarks

403.gcc

Implicitly Included Flags

Commands and Options Used to Submit Benchmark Runs

Submit command from Intel-ic14.0-official-linux64-revC

SPEC CPU2006 Flag Description for the Intel(R) C++ and Fortran Compiler 14.0 for IA32 and Intel 64 applications

Submit command from EXO-platform-flags-HSW-v1.1

SPEC CPU2006 Flag Description for EXO Platform

Shell, Environment, and Other Software Settings

Sw environment from Intel-ic14.0-official-linux64-revC

SPEC CPU2006 Flag Description for the Intel(R) C++ and Fortran Compiler 14.0 for IA32 and Intel 64 applications

Red Hat Specific features

Sw environment from EXO-platform-flags-HSW-v1.1

SPEC CPU2006 Flag Description for EXO Platform

Firmware / BIOS / Microcode Settings

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

CPU2006 Flag DescriptionEXO S.A. SERVER EXO SPRINT X385

Base Compiler Invocation

Base Portability Flags

Base Optimization Flags

Base Other Flags

Implicitly Included Flags

Submit command from Intel-ic14.0-official-linux64-revC

SPEC CPU2006 Flag Description for the Intel(R) C++ and Fortran Compiler 14.0 for IA32 and Intel 64 applications

Submit command from EXO-platform-flags-HSW-v1.1

SPEC CPU2006 Flag Description for EXO Platform

Sw environment from Intel-ic14.0-official-linux64-revC

SPEC CPU2006 Flag Description for the Intel(R) C++ and Fortran Compiler 14.0 for IA32 and Intel 64 applications

Red Hat Specific features

Sw environment from EXO-platform-flags-HSW-v1.1

SPEC CPU2006 Flag Description for EXO Platform

CPU2006 Flag Description
EXO S.A. SERVER EXO SPRINT X385