CPU2006 Result Flag Description

Operating systems: SUSE Linux Enterprise 10, SUSE Linux Enterprise 11, and Red Hat Enterprise Linux Advanced Platform 5 and 6

Base Optimization Flags

C benchmarks

- -O3
- COPTIMIZE
- Optimize yet more. -O3 turns on all optimizations specified by -O2 and also turns on the -finline-functions, -funswitch-loops and -fgcse-after-reload options.
- Includes:
- -mcpu=power7
- COPTIMIZE
- Tune to cpu-type everything applicable about the generated code, except for the ABI and the set of available instructions.
  
  A sample list of supported values for this flag are
  - power
  - power2
  - power3
  - power4
  - power5
  - power5+
  - power6
  - power7
- -mtune=power7
- COPTIMIZE
- Sets the instruction scheduling parameters for a particular machine type, but does not set the architecture type, register usage, or choice of mnemonics, as -mcpu=cpu_type would. The same values for cpu_type are used for -mtune as for -mcpu. If both are specified, the code generated will use the architecture, registers, and mnemonics set by -mcpu, but the scheduling parameters set by -mtune.
- -m32
- COPTIMIZE
- Generate code for a 32-bit environment. The 32-bit environment sets int, long and pointer to 32 bits and generates code that runs on 32 bits system.
- -fpeel-loops
- COPTIMIZE
- Peels the loops for that there is enough information that they do not roll much (from profile feedback). It also turns on complete loop peeling (i.e. complete removal of loops with small constant number of iterations). Enabled with -fprofile-use
- -funroll-loops
- COPTIMIZE
- Unroll loops whose number of iterations can be determined at compile time or upon entry to the loop. -funroll-loops implies both -fstrength-reduce and -frerun-cse-after-loop. This option makes code larger, and may or may not make it run faster.
- Includes:
  - -fstrength-reduce
  - -frerun-cse-after-loop
- -ffast-math
- COPTIMIZE
- Sets the following flags:
  - -fno-math-errno
  - -funsafe-math-optimizations
  - -fno-trapping-math
  - -ffinite-math-only
  - -fno-signaling-nans
- Includes:
- -ftree-vectorize
- COPTIMIZE
- Perform loop vectorization on trees.
- -mvsx
- COPTIMIZE
- Generate code that uses vector/scalar (VSX) instructions, and also enable the use of built-in functions that allow more direct access to the VSX instruction set.
- -maltivec
- COPTIMIZE
- Generate code that uses AltiVec instructions, and also enable the use of built-in functions that allow more direct access to the AltiVec instruction set.
- -mpopcntd
- COPTIMIZE
- Allows GCC to generate the popcount instruction implemented on the POWER7 processor and other processors that support the PowerPC V2.06 architecture.
- -mrecip=rsqrt
- COPTIMIZE
- This option will enable GCC to use RCPSS and RSQRTSS instructions (and their vectorized variants RCPPS and RSQRTPS) with an additional Newton-Raphson step to increase precision instead of DIVSS and SQRTSS (and their vectorized variants) for single precision floating point arguments. These instructions are generated only when -funsafe-math-optimizations is enabled together with -finite-math-only and -fno-trapping-math.
- -flto
- COPTIMIZE
- This option runs the standard link-time optimizer. When invoked with source code, it generates GIMPLE (one of GCC's internal representations) and writes it to special ELF sections in the object file. When the object files are linked together, all the function bodies are read from these ELF sections and instantiated as if they had been part of the same translation unit.
- -fwhole-program
- COPTIMIZE
- Assume that the current compilation unit represents whole program being compiled. All public functions and variables with the exception of "main" and those merged by attribute "externally_visible" become static functions and in a affect gets more aggressively optimized by interprocedural optimizers.
- -fuse-linker-plugin
- COPTIMIZE
- Enables the use of linker plugin during link time optimization. This option relies on the linker plugin support in linker that is available in gold or in GNU ld 2.21 or newer. This option enables the extraction of object files with GIMPLE bytecode out of library archives. This improves the quality of optimization by exposing more code the the link time optimizer. This information specify what symbols can be accessed externally (by non-LTO object or during dynamic linking). Resulting code quality improvements on binaries (and shared libraries that do use hidden visibility) is similar to -fwhole-program. See -flto for a description on the effect of this flag and how to use it. Enabled by default when LTO support in GCC is enabled and GCC was compiled with a linker supporting plugins (GNU ld 2.21 or newer or gold).
- -lhugetlbfs
- EXTRA_CLIBS
- Link with libhugetlbfs.so. This enables heap to be backed by the 16 Megabyte pages.

C++ benchmarks

- -O3
- CXXOPTIMIZE
- Optimize yet more. -O3 turns on all optimizations specified by -O2 and also turns on the -finline-functions, -funswitch-loops and -fgcse-after-reload options.
- Includes:
- -mcpu=power7
- CXXOPTIMIZE
- Tune to cpu-type everything applicable about the generated code, except for the ABI and the set of available instructions.
  
  A sample list of supported values for this flag are
  - power
  - power2
  - power3
  - power4
  - power5
  - power5+
  - power6
  - power7
- -mtune=power7
- CXXOPTIMIZE
- Sets the instruction scheduling parameters for a particular machine type, but does not set the architecture type, register usage, or choice of mnemonics, as -mcpu=cpu_type would. The same values for cpu_type are used for -mtune as for -mcpu. If both are specified, the code generated will use the architecture, registers, and mnemonics set by -mcpu, but the scheduling parameters set by -mtune.
- -m32
- CXXOPTIMIZE
- Generate code for a 32-bit environment. The 32-bit environment sets int, long and pointer to 32 bits and generates code that runs on 32 bits system.
- -fpeel-loops
- CXXOPTIMIZE
- Peels the loops for that there is enough information that they do not roll much (from profile feedback). It also turns on complete loop peeling (i.e. complete removal of loops with small constant number of iterations). Enabled with -fprofile-use
- -funroll-loops
- CXXOPTIMIZE
- Unroll loops whose number of iterations can be determined at compile time or upon entry to the loop. -funroll-loops implies both -fstrength-reduce and -frerun-cse-after-loop. This option makes code larger, and may or may not make it run faster.
- Includes:
  - -fstrength-reduce
  - -frerun-cse-after-loop
- -ffast-math
- CXXOPTIMIZE
- Sets the following flags:
  - -fno-math-errno
  - -funsafe-math-optimizations
  - -fno-trapping-math
  - -ffinite-math-only
  - -fno-signaling-nans
- Includes:
- -ftree-vectorize
- CXXOPTIMIZE
- Perform loop vectorization on trees.
- -mvsx
- CXXOPTIMIZE
- Generate code that uses vector/scalar (VSX) instructions, and also enable the use of built-in functions that allow more direct access to the VSX instruction set.
- -maltivec
- CXXOPTIMIZE
- Generate code that uses AltiVec instructions, and also enable the use of built-in functions that allow more direct access to the AltiVec instruction set.
- -mpopcntd
- CXXOPTIMIZE
- Allows GCC to generate the popcount instruction implemented on the POWER7 processor and other processors that support the PowerPC V2.06 architecture.
- -mrecip=rsqrt
- CXXOPTIMIZE
- This option will enable GCC to use RCPSS and RSQRTSS instructions (and their vectorized variants RCPPS and RSQRTPS) with an additional Newton-Raphson step to increase precision instead of DIVSS and SQRTSS (and their vectorized variants) for single precision floating point arguments. These instructions are generated only when -funsafe-math-optimizations is enabled together with -finite-math-only and -fno-trapping-math.
- -flto
- CXXOPTIMIZE
- This option runs the standard link-time optimizer. When invoked with source code, it generates GIMPLE (one of GCC's internal representations) and writes it to special ELF sections in the object file. When the object files are linked together, all the function bodies are read from these ELF sections and instantiated as if they had been part of the same translation unit.
- -fwhole-program
- CXXOPTIMIZE
- Assume that the current compilation unit represents whole program being compiled. All public functions and variables with the exception of "main" and those merged by attribute "externally_visible" become static functions and in a affect gets more aggressively optimized by interprocedural optimizers.
- -fuse-linker-plugin
- CXXOPTIMIZE
- Enables the use of linker plugin during link time optimization. This option relies on the linker plugin support in linker that is available in gold or in GNU ld 2.21 or newer. This option enables the extraction of object files with GIMPLE bytecode out of library archives. This improves the quality of optimization by exposing more code the the link time optimizer. This information specify what symbols can be accessed externally (by non-LTO object or during dynamic linking). Resulting code quality improvements on binaries (and shared libraries that do use hidden visibility) is similar to -fwhole-program. See -flto for a description on the effect of this flag and how to use it. Enabled by default when LTO support in GCC is enabled and GCC was compiled with a linker supporting plugins (GNU ld 2.21 or newer or gold).
- -ltcmalloc
- EXTRA_CXXLIBS
- Link with tcmalloc's library for Linux on POWER. This is a library that optimizes calls to new, delete, malloc and free.

Peak Optimization Flags

C benchmarks

C++ benchmarks

483.xalancbmk

- basepeak = yes

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

Virtualization Settings

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2006-2014 Standard Performance Evaluation Corporation
Tested with SPEC CPU2006 v1.2.
Report generated on Thu Jul 24 13:38:03 2014 by SPEC CPU2006 flags formatter v6906.

CPU2006 Flag Description
IBM Corporation IBM Power 780 (3.7 GHz, 128 core, RHEL, GCC)

Base Compiler Invocation

C benchmarks

C++ benchmarks

Base Portability Flags

400.perlbench

462.libquantum

464.h264ref

483.xalancbmk

Base Optimization Flags

C benchmarks

C++ benchmarks

Peak Optimization Flags

C benchmarks

400.perlbench

401.bzip2

403.gcc

429.mcf

445.gobmk

456.hmmer

458.sjeng

462.libquantum

464.h264ref

C++ benchmarks

471.omnetpp

473.astar

483.xalancbmk

Implicitly Included Flags

Virtualization Settings

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

CPU2006 Flag DescriptionIBM Corporation IBM Power 780 (3.7 GHz, 128 core, RHEL, GCC)

Base Compiler Invocation

Base Portability Flags

Base Optimization Flags

Peak Optimization Flags

Implicitly Included Flags

CPU2006 Flag Description
IBM Corporation IBM Power 780 (3.7 GHz, 128 core, RHEL, GCC)