CPU2006 Result Flag Description

Base Portability Flags

410.bwaves

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

416.gamess

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

433.milc

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

434.zeusmp

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

435.gromacs

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.
- -nofor_main
- LDPORTABILITY
- This option specifies that the main program is not written in Fortran. It is a link-time option that prevents the compiler from linking for_main.o into applications.
  
  For example, if the main program is written in C and calls a Fortran subprogram, specify -nofor-main when compiling the program with the ifort command. If you omit this option, the main program must be a Fortran program.

436.cactusADM

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.
- -nofor_main
- LDPORTABILITY
- This option specifies that the main program is not written in Fortran. It is a link-time option that prevents the compiler from linking for_main.o into applications.
  
  For example, if the main program is written in C and calls a Fortran subprogram, specify -nofor-main when compiling the program with the ifort command. If you omit this option, the main program must be a Fortran program.

437.leslie3d

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

444.namd

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

447.dealII

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

450.soplex

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

453.povray

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

454.calculix

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.
- -nofor_main
- LDPORTABILITY
- This option specifies that the main program is not written in Fortran. It is a link-time option that prevents the compiler from linking for_main.o into applications.
  
  For example, if the main program is written in C and calls a Fortran subprogram, specify -nofor-main when compiling the program with the ifort command. If you omit this option, the main program must be a Fortran program.

459.GemsFDTD

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

465.tonto

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

470.lbm

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

481.wrf

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.
- -DSPEC_CPU_CASE_FLAG
- CPORTABILITY
- This macro indicates that Fortran functions called from C should have their names lower-cased.
- -DSPEC_CPU_LINUX
- CPORTABILITY
- This macro indicates that the benchmark is being compiled on a Linux system.

482.sphinx3

- -DSPEC_CPU_LP64
- PORTABILITY
- This option is used to indicate that the host system's integers are 32-bits wide, and longs and pointers are 64-bits wide. Not all benchmarks recognize this macro, but the preferred practice for data model selection applies the flags to all benchmarks; this flag description is a placeholder for those benchmarks that do not recognize this macro.

Peak Optimization Flags

C benchmarks

433.milc

- -prof-gen
- PASS1_CFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_CFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -fno-alias
- OPTIMIZE
- This options tells the compiler to assume no aliasing in the program.
- -auto-ilp32
- COPTIMIZE
- This option instructs the compiler to analyze and transform the program so that 64-bit pointers are shrunk to 32-bit pointers, and 64-bit longs (on Linux) are shrunk into 32-bit longs wherever it is legal and safe to do so. In order for this option to be effective the compiler must be able to optimize using the -ipo/-Qipo option and must be able to analyze all library/external calls the program makes.
  
  This option requires that the size of the program executable never exceeds 2³² bytes and all data values can be represented within 32 bits. If the program can run correctly in a 32-bit system, these requirements are implicitly satisfied. If the program violates these size restrictions, unpredictable behavior might occur.

470.lbm

- -prof-gen
- PASS1_CFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_CFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -unroll2
- COPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -unroll2 would unroll a maximum of 2 times.
- -scalar-rep-
- COPTIMIZE
- -scalar-rep enables scalar replacement performed during loop transformation. To use this option, you must also specify O3. -scalar-rep- disables this optimization.
- -prefetch
- COPTIMIZE
- Enable/disable(DEFAULT) the compiler to generate prefetch instructions to prefetch data.
- -opt-malloc-options=3
- COPTIMIZE
- The compiler adds setup code in the C/C++/Fortran main function to enable optimal malloc algorithms:
  Function: int mallopt (int param, int value) When calling mallopt, the param argument specifies the parameter to be set, and value the new value to be set. Possible choices for param, as defined in malloc.h, are:

482.sphinx3

- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -unroll2
- COPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -unroll2 would unroll a maximum of 2 times.

C++ benchmarks

444.namd

- -prof-gen
- PASS1_CXXFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_CXXFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- CXXOPTIMIZE, OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -fno-alias
- CXXOPTIMIZE
- This options tells the compiler to assume no aliasing in the program.
- -auto-ilp32
- CXXOPTIMIZE
- This option instructs the compiler to analyze and transform the program so that 64-bit pointers are shrunk to 32-bit pointers, and 64-bit longs (on Linux) are shrunk into 32-bit longs wherever it is legal and safe to do so. In order for this option to be effective the compiler must be able to optimize using the -ipo/-Qipo option and must be able to analyze all library/external calls the program makes.
  
  This option requires that the size of the program executable never exceeds 2³² bytes and all data values can be represented within 32 bits. If the program can run correctly in a 32-bit system, these requirements are implicitly satisfied. If the program violates these size restrictions, unpredictable behavior might occur.

447.dealII

- -prof-gen
- PASS1_CXXFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_CXXFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- CXXOPTIMIZE, OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -unroll2
- CXXOPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -unroll2 would unroll a maximum of 2 times.
- -ansi-alias
- CXXOPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.
- -scalar-rep-
- CXXOPTIMIZE
- -scalar-rep enables scalar replacement performed during loop transformation. To use this option, you must also specify O3. -scalar-rep- disables this optimization.

450.soplex

- -prof-gen
- PASS1_CXXFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_CXXFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -opt-malloc-options=3
- OPTIMIZE
- The compiler adds setup code in the C/C++/Fortran main function to enable optimal malloc algorithms:
  Function: int mallopt (int param, int value) When calling mallopt, the param argument specifies the parameter to be set, and value the new value to be set. Possible choices for param, as defined in malloc.h, are:

453.povray

- -prof-gen
- PASS1_CXXFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_CXXFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- CXXOPTIMIZE, OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -unroll4
- CXXOPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -unroll2 would unroll a maximum of 2 times.
- -ansi-alias
- CXXOPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.

Fortran benchmarks

410.bwaves

- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -prefetch
- OPTIMIZE
- Enable/disable(DEFAULT) the compiler to generate prefetch instructions to prefetch data.
- -parallel
- OPTIMIZE
- Tells the auto-parallelizer to generate multithreaded code for loops that can be safely executed in parallel. To use this option, you must also specify option O2 or O3. The default numbers of threads spawned is equal to the number of processors detected in the system where the binary is compiled. Can be changed by setting the environment variable OMP_NUM_THREADS

416.gamess

- -prof-gen
- PASS1_FFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_FFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -unroll2
- OPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -unroll2 would unroll a maximum of 2 times.
- -Ob0
- OPTIMIZE
- Specifies the level of inline function expansion.
  
  Ob0 - Disables inlining of user-defined functions. Note that statement functions are always inlined.
  
  Ob1 - Enables inlining when an inline keyword or an inline attribute is specified. Also enables inlining according to the C++ language.
  
  Ob2 - Enables inlining of any function at the compiler's discretion.
- -ansi-alias
- OPTIMIZE
- Enable/disable(DEFAULT) use of ANSI aliasing rules in optimizations; user asserts that the program adheres to these rules.
- -scalar-rep-
- OPTIMIZE
- -scalar-rep enables scalar replacement performed during loop transformation. To use this option, you must also specify O3. -scalar-rep- disables this optimization.

434.zeusmp

- -prof-gen
- PASS1_FFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_FFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div

459.GemsFDTD

- -prof-gen
- PASS1_FFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_FFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -unroll2
- OPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -unroll2 would unroll a maximum of 2 times.
- -Ob0
- OPTIMIZE
- Specifies the level of inline function expansion.
  
  Ob0 - Disables inlining of user-defined functions. Note that statement functions are always inlined.
  
  Ob1 - Enables inlining when an inline keyword or an inline attribute is specified. Also enables inlining according to the C++ language.
  
  Ob2 - Enables inlining of any function at the compiler's discretion.
- -prefetch
- OPTIMIZE
- Enable/disable(DEFAULT) the compiler to generate prefetch instructions to prefetch data.
- -parallel
- OPTIMIZE
- Tells the auto-parallelizer to generate multithreaded code for loops that can be safely executed in parallel. To use this option, you must also specify option O2 or O3. The default numbers of threads spawned is equal to the number of processors detected in the system where the binary is compiled. Can be changed by setting the environment variable OMP_NUM_THREADS

465.tonto

- -prof-gen
- PASS1_FFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_FFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -unroll4
- OPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -unroll2 would unroll a maximum of 2 times.
- -auto
- OPTIMIZE
- Make all local variables AUTOMATIC. Same as -automatic

Benchmarks using both Fortran and C

435.gromacs

- -prof-gen
- PASS1_CFLAGS, PASS1_FFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_CFLAGS, PASS2_FFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -prefetch
- OPTIMIZE
- Enable/disable(DEFAULT) the compiler to generate prefetch instructions to prefetch data.
- -auto-ilp32
- COPTIMIZE
- This option instructs the compiler to analyze and transform the program so that 64-bit pointers are shrunk to 32-bit pointers, and 64-bit longs (on Linux) are shrunk into 32-bit longs wherever it is legal and safe to do so. In order for this option to be effective the compiler must be able to optimize using the -ipo/-Qipo option and must be able to analyze all library/external calls the program makes.
  
  This option requires that the size of the program executable never exceeds 2³² bytes and all data values can be represented within 32 bits. If the program can run correctly in a 32-bit system, these requirements are implicitly satisfied. If the program violates these size restrictions, unpredictable behavior might occur.

436.cactusADM

- -prof-gen
- PASS1_CFLAGS, PASS1_FFLAGS, PASS1_LDFLAGS
- Instrument program for profiling for the first phase of two-phase profile guided otimization. This instrumentation gathers information about a program's execution paths and data values but does not gather information from hardware performance counters. The profile instrumentation also gathers data for optimizations which are unique to profile-feedback optimization.
- -prof-use
- PASS2_CFLAGS, PASS2_FFLAGS, PASS2_LDFLAGS
- Instructs the compiler to produce a profile-optimized executable and merges available dynamic information (.dyn) files into a pgopti.dpi file. If you perform multiple executions of the instrumented program, -prof-use merges the dynamic information files again and overwrites the previous pgopti.dpi file.
  Without any other options, the current directory is searched for .dyn files
- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -unroll2
- OPTIMIZE
- Tells the compiler the maximum number of times to unroll loops. For example -unroll2 would unroll a maximum of 2 times.
- -prefetch
- OPTIMIZE
- Enable/disable(DEFAULT) the compiler to generate prefetch instructions to prefetch data.
- -parallel
- OPTIMIZE
- Tells the auto-parallelizer to generate multithreaded code for loops that can be safely executed in parallel. To use this option, you must also specify option O2 or O3. The default numbers of threads spawned is equal to the number of processors detected in the system where the binary is compiled. Can be changed by setting the environment variable OMP_NUM_THREADS
- -auto-ilp32
- COPTIMIZE
- This option instructs the compiler to analyze and transform the program so that 64-bit pointers are shrunk to 32-bit pointers, and 64-bit longs (on Linux) are shrunk into 32-bit longs wherever it is legal and safe to do so. In order for this option to be effective the compiler must be able to optimize using the -ipo/-Qipo option and must be able to analyze all library/external calls the program makes.
  
  This option requires that the size of the program executable never exceeds 2³² bytes and all data values can be represented within 32 bits. If the program can run correctly in a 32-bit system, these requirements are implicitly satisfied. If the program violates these size restrictions, unpredictable behavior might occur.

454.calculix

- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -unroll-aggressive
- OPTIMIZE
- Enables more aggressive unrolling heuristics
- -auto-ilp32
- COPTIMIZE
- This option instructs the compiler to analyze and transform the program so that 64-bit pointers are shrunk to 32-bit pointers, and 64-bit longs (on Linux) are shrunk into 32-bit longs wherever it is legal and safe to do so. In order for this option to be effective the compiler must be able to optimize using the -ipo/-Qipo option and must be able to analyze all library/external calls the program makes.
  
  This option requires that the size of the program executable never exceeds 2³² bytes and all data values can be represented within 32 bits. If the program can run correctly in a 32-bit system, these requirements are implicitly satisfied. If the program violates these size restrictions, unpredictable behavior might occur.

481.wrf

- -fast
- OPTIMIZE
- The -fast option enhances execution speed across the entire program by including the following options that can improve run-time performance:
  
  -O3 (maximum speed and high-level optimizations)
  
  -ipo (enables interprocedural optimizations across files)
  
  -xT (generate code specialized for Intel(R) Core(TM)2 Duo processors, Intel(R) Core(TM)2 Quad processors and Intel(R) Xeon(R) processors with SSSE3)
  
  -static Statically link in libraries at link time
  
  -no-prec-div (disable -prec-div) where -prec-div improves precision of FP divides (some speed impact)
  
  To override one of the options set by -fast, specify that option after the -fast option on the command line. The exception is the xT option which can't be overridden. The options set by -fast may change from release to release.
- Includes:
  - -O3
    - -O2
    - -fomit-frame-pointer
  - -ipo
  - -xT
  - -static
  - -no-prec-div
- -parallel
- OPTIMIZE
- Tells the auto-parallelizer to generate multithreaded code for loops that can be safely executed in parallel. To use this option, you must also specify option O2 or O3. The default numbers of threads spawned is equal to the number of processors detected in the system where the binary is compiled. Can be changed by setting the environment variable OMP_NUM_THREADS
- -prefetch
- OPTIMIZE
- Enable/disable(DEFAULT) the compiler to generate prefetch instructions to prefetch data.
- -auto-ilp32
- COPTIMIZE
- This option instructs the compiler to analyze and transform the program so that 64-bit pointers are shrunk to 32-bit pointers, and 64-bit longs (on Linux) are shrunk into 32-bit longs wherever it is legal and safe to do so. In order for this option to be effective the compiler must be able to optimize using the -ipo/-Qipo option and must be able to analyze all library/external calls the program makes.
  
  This option requires that the size of the program executable never exceeds 2³² bytes and all data values can be represented within 32 bits. If the program can run correctly in a 32-bit system, these requirements are implicitly satisfied. If the program violates these size restrictions, unpredictable behavior might occur.

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

System and Other Tuning Information

One or more of the following settings may have been set. If so, the corresponding notes sections of the report will say so; and you can read below to find out more about what these settings mean.

This Environment Variable sets the maximum number of threads to use for OpenMP* parallel regions if no other value is specified in the application. This environment variable applies to both -openmp and -parallel (Linux and Mac OS X) or /Qopenmp and /Qparallel (Windows). Example syntax on a Linux system with 8 cores:
export OMP_NUM_THREADS=8
Default is the number of cores visible to the OS.

KMP_AFFINITY = < physical | logical >, starting-core-id
This Environment Variable specifies the static mapping of user threads to physical cores, for example, if you have a system configured with 8 cores, OMP_NUM_THREADS=8 and KMP_AFFINITY=physical,2. Thread 0 will mapped to core 2, thread 1 will be mapped to core 3, and so on in a round-robin fashion.

This BIOS option allows the enabling/disabling of a processor mechanism to prefetch data into the cache according to a pattern-recognition algorithm.

In some cases, setting this option to Disabled may improve performance. Users should only disable this option after performing application benchmarking to verify improved performance in their environment.

This BIOS option allows the enabling/disabling of a processor mechanism to fetch the adjacent cache line within an 128-byte sector that contains the data needed due to a cache line miss.

This BIOS option enables/disables the Snoop Filter. The Snoop Filter is designed to reduce system bus utilization coming from cache misses. On the Intel 5000X and 5400 chipset, it is built as a cache structure able to minimize unnecessary snoop traffic.
When enabled, it can lead to significant memory performance improvements for several workstation applications on suitable memory configurations.

This Linux command (a bash builtin command) sets the stack size to n kbytes, or unlimited to allow the stack size to grow without limit.

submit= MYMASK=`printf '0x%x' \$((1<<\$SPECCOPYNUM))`; /usr/bin/taskset \$MYMASK $command

When running multiple copies of benchmarks, the SPEC config file feature submit is sometimes used to cause individual jobs to be bound to specific processors. This specific submit command is used for Linux. The description of the elements of the command are:

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2006-2014 Standard Performance Evaluation Corporation
Tested with SPEC CPU2006 v1.0.
Report generated on Tue Jul 22 19:09:24 2014 by SPEC CPU2006 flags formatter v6906.

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

CPU2006 Flag DescriptionFujitsu Siemens Computers PRIMERGY RX300 S4, Intel Xeon E5420, 2.50 GHz

Base Compiler Invocation

Peak Compiler Invocation

Base Portability Flags

Peak Portability Flags

Base Optimization Flags

Peak Optimization Flags

Implicitly Included Flags

CPU2006 Flag Description
Fujitsu Siemens Computers PRIMERGY RX300 S4, Intel Xeon E5420, 2.50 GHz