ACCEL Result Flag Description

Peak Portability Flags

503.postencil

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.

504.polbm

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enables the use of nested SIMD statements for OpenMP.

514.pomriq

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enables the use of nested SIMD statements for OpenMP.

550.pmd

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.
- -80
- FPORTABILITY
- FPORTABILITY flag

551.ppalm

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.
- -DSPEC_HOST_FFTW3
- FPORTABILITY, PORTABILITY
- By default, 551.ppalm uses the Temperton Algorithm to compute FFTs. By defining SPEC_HOST_FFTW3, the benchmark will instead use a user suppiled FFTW3 library. The arrays passed to this library will be the host copy.
  
  Users must specify both -DSPEC_HOST_FFTW as well as the include path to the FFTW3 interface file, fftw3.f03. They must also add the FFTW3 libary to the libraries. For example:
  
  315.palm:
  PORTABILITY+= -I/path/to/include -DSPEC_HOST_FFTW
  EXTRA_LIBS += -I/path/to/lib -lfftw3_host_lib

552.pep

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enables the use of nested SIMD statements for OpenMP.

553.pclvrleaf

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.

554.pcg

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enables the use of nested SIMD statements for OpenMP.

555.pseismic

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.

556.psp

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.

557.pcsp

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.

559.pmniGhost

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.
- -nofor-main
- FPORTABILITY
- No Fortran main method exists, use C equivalent instead.

560.pilbdc

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enables the use of nested SIMD statements for OpenMP.

563.pswim

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.

570.pbt

- -DSPEC_USE_INNER_SIMD
- PORTABILITY
- Enable use of SIMD directive inside of loop rather than on outer loop.

Base Optimization Flags

C benchmarks

- -O3
- OPTIMIZE
- optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs
- -xCORE-AVX2
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  May generate Intel(R) Advanced Vector Extensions 2 (Intel(R) AVX2), Intel(R) AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2, and SSE instructions for Intel(R) processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives (same as -fopenmp)
- -qopenmp-offload=host
- OPTIMIZE
- Enables OpenMP* offloading compilation for target pragmas. This option only applies to Intel(R) MIC Architecture and Intel(R) Graphics Technology. Enabled by default with -qopenmp. Use -qno-openmp-offload to disable.
  Specify kind to specify the default device for target pragmas
  host - allow target code to run on host system while still doing the outlining for offload
  mic - specify Intel(R) MIC Architecture
  gfx - specify Intel(R) Graphics Technology
- -fimf-precision=low:sqrt,exp,log,/
- OPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied

Fortran benchmarks

- -O3
- OPTIMIZE
- optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs
- -xCORE-AVX2
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  May generate Intel(R) Advanced Vector Extensions 2 (Intel(R) AVX2), Intel(R) AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2, and SSE instructions for Intel(R) processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives (same as -fopenmp)
- -qopenmp-offload=host
- OPTIMIZE
- Enables OpenMP* offloading compilation for target pragmas. This option only applies to Intel(R) MIC Architecture and Intel(R) Graphics Technology. Enabled by default with -qopenmp. Use -qno-openmp-offload to disable.
  Specify kind to specify the default device for target pragmas
  host - allow target code to run on host system while still doing the outlining for offload
  mic - specify Intel(R) MIC Architecture
  gfx - specify Intel(R) Graphics Technology
- -fimf-precision=low:sqrt,exp,log,/
- OPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied

Benchmarks using both Fortran and C

- -O3
- OPTIMIZE
- optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs
- -xCORE-AVX2
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  May generate Intel(R) Advanced Vector Extensions 2 (Intel(R) AVX2), Intel(R) AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2, and SSE instructions for Intel(R) processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives (same as -fopenmp)
- -qopenmp-offload=host
- OPTIMIZE
- Enables OpenMP* offloading compilation for target pragmas. This option only applies to Intel(R) MIC Architecture and Intel(R) Graphics Technology. Enabled by default with -qopenmp. Use -qno-openmp-offload to disable.
  Specify kind to specify the default device for target pragmas
  host - allow target code to run on host system while still doing the outlining for offload
  mic - specify Intel(R) MIC Architecture
  gfx - specify Intel(R) Graphics Technology
- -fimf-precision=low:sqrt,exp,log,/
- OPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied

Peak Optimization Flags

C benchmarks

504.polbm

- -O3
- OPTIMIZE
- optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs
- -xCORE-AVX2
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  May generate Intel(R) Advanced Vector Extensions 2 (Intel(R) AVX2), Intel(R) AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2, and SSE instructions for Intel(R) processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives (same as -fopenmp)
- -qopenmp-offload=host
- OPTIMIZE
- Enables OpenMP* offloading compilation for target pragmas. This option only applies to Intel(R) MIC Architecture and Intel(R) Graphics Technology. Enabled by default with -qopenmp. Use -qno-openmp-offload to disable.
  Specify kind to specify the default device for target pragmas
  host - allow target code to run on host system while still doing the outlining for offload
  mic - specify Intel(R) MIC Architecture
  gfx - specify Intel(R) Graphics Technology
- -fimf-precision=low:sqrt,exp,log,/
- OPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied

552.pep

- -O3
- OPTIMIZE
- optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs
- -xCORE-AVX2
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  May generate Intel(R) Advanced Vector Extensions 2 (Intel(R) AVX2), Intel(R) AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2, and SSE instructions for Intel(R) processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives (same as -fopenmp)
- -qopenmp-offload=host
- OPTIMIZE
- Enables OpenMP* offloading compilation for target pragmas. This option only applies to Intel(R) MIC Architecture and Intel(R) Graphics Technology. Enabled by default with -qopenmp. Use -qno-openmp-offload to disable.
  Specify kind to specify the default device for target pragmas
  host - allow target code to run on host system while still doing the outlining for offload
  mic - specify Intel(R) MIC Architecture
  gfx - specify Intel(R) Graphics Technology
- -fimf-precision=low:sqrt,exp,log,/
- OPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied
- -qopt-streaming-stores always
- OPTIMIZE
- Specifies whether streaming stores are generated:
  
  always - enables generation of streaming stores under the assumption that the application is memory bound
  
  auto - compiler decides when streaming stores are used (DEFAULT)
  
  never - disables generation of streaming stores

554.pcg

- -O3
- OPTIMIZE
- optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs
- -xCORE-AVX2
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  May generate Intel(R) Advanced Vector Extensions 2 (Intel(R) AVX2), Intel(R) AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2, and SSE instructions for Intel(R) processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives (same as -fopenmp)
- -qopenmp-offload=host
- OPTIMIZE
- Enables OpenMP* offloading compilation for target pragmas. This option only applies to Intel(R) MIC Architecture and Intel(R) Graphics Technology. Enabled by default with -qopenmp. Use -qno-openmp-offload to disable.
  Specify kind to specify the default device for target pragmas
  host - allow target code to run on host system while still doing the outlining for offload
  mic - specify Intel(R) MIC Architecture
  gfx - specify Intel(R) Graphics Technology
- -fimf-precision=low:sqrt,exp,log,/
- OPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied
- -qopt-prefetch=5
- OPTIMIZE
- Enable levels of prefetch insertion, where 0 disables. n may be 0 through 5 inclusive. Default is 2.

Fortran benchmarks

550.pmd

- -O3
- OPTIMIZE
- optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs
- -xCORE-AVX2
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  May generate Intel(R) Advanced Vector Extensions 2 (Intel(R) AVX2), Intel(R) AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2, and SSE instructions for Intel(R) processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives (same as -fopenmp)
- -qopenmp-offload=host
- OPTIMIZE
- Enables OpenMP* offloading compilation for target pragmas. This option only applies to Intel(R) MIC Architecture and Intel(R) Graphics Technology. Enabled by default with -qopenmp. Use -qno-openmp-offload to disable.
  Specify kind to specify the default device for target pragmas
  host - allow target code to run on host system while still doing the outlining for offload
  mic - specify Intel(R) MIC Architecture
  gfx - specify Intel(R) Graphics Technology
- -fimf-precision=low:sqrt,exp,log,/
- OPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied
- -qopt-prefetch=2
- OPTIMIZE
- Enable levels of prefetch insertion, where 0 disables. n may be 0 through 5 inclusive. Default is 2.

551.ppalm

- -O3
- OPTIMIZE
- optimize for maximum speed and enable more aggressive optimizations that may not improve performance on some programs
- -xCORE-AVX2
- OPTIMIZE
- Code is optimized for Intel(R) processors with support for AVX2 instructions. The resulting code may contain unconditional use of features that are not supported on other processors. This option also enables new optimizations in addition to Intel processor-specific optimizations including advanced data layout and code restructuring optimizations to improve memory accesses for Intel processors.
  
  May generate Intel(R) Advanced Vector Extensions 2 (Intel(R) AVX2), Intel(R) AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2, and SSE instructions for Intel(R) processors.
  
  Do not use this option if you are executing a program on a processor that is not an Intel processor. If you use this option on a non-compatible processor to compile the main program (in Fortran) or the function main() in C/C++, the program will display a fatal run-time error if they are executed on unsupported processors.
- -qopenmp
- OPTIMIZE
- Enable the compiler to generate multi-threaded code based on the OpenMP* directives (same as -fopenmp)
- -qopenmp-offload=host
- OPTIMIZE
- Enables OpenMP* offloading compilation for target pragmas. This option only applies to Intel(R) MIC Architecture and Intel(R) Graphics Technology. Enabled by default with -qopenmp. Use -qno-openmp-offload to disable.
  Specify kind to specify the default device for target pragmas
  host - allow target code to run on host system while still doing the outlining for offload
  mic - specify Intel(R) MIC Architecture
  gfx - specify Intel(R) Graphics Technology
- -fimf-precision=low:sqrt,exp,log,/
- OPTIMIZE
- -fimf-precision=value[:funclist]
  defines the accuracy (precision) for math library functions
  value - defined as one of the following values
  high - equivalent to max-error = 0.6
  medium - equivalent to max-error = 4 (DEFAULT)
  low - equivalent to accuracy-bits = 11 (single precision); accuracy-bits = 26 (double precision)
  funclist - optional comma separated list of one or more math library functions to which the attribute should be applied
- -I/home/abobyr/FFTW-3.3.6/include
- OPTIMIZE
- Adds the directory for include files to the search path at compile time.
- -L/home/abobyr/FFTW-3.3.6/lib
- LIBS
- Adds the library directory search path at link time

Benchmarks using both Fortran and C

559.pmniGhost

- basepeak = yes

Shell, Environment, and Other Software Settings

One or more of the following settings may have been applied to the testbed. If so, the "Platform Notes" section of the report will say so; and you can read below to find out more about what these settings mean.

LD_LIBRARY_PATH=<directories> (linker)
LD_LIBRARY_PATH controls the search order for both the compile-time and run-time linkers. Usually, it can be defaulted; but testers may sometimes choose to explicitly set it (as documented in the notes in the submission), in order to ensure that the correct versions of libraries are picked up.

STACKSIZE=<n> (Unix)
Set the size of the stack (temporary storage area) for each slave thread of a multithreaded program.

ulimit -s <n> (Unix)
Sets the stack size to n kbytes, or "unlimited" to allow the stack size to grow without limit.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2015-2017 Standard Performance Evaluation Corporation
Tested with SPEC ACCEL v75.
Report generated on Wed Jun 21 17:15:12 2017 by SPEC ACCEL flags formatter v1290.

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

ACCEL Flag DescriptionIntel Endeavour Node(Intel Xeon E5-2697 v4, 2.3GHz, DDR4-2400 MHz, SMT ON, Turbo OFF)

Base Compiler Invocation

Peak Compiler Invocation

Base Portability Flags

Peak Portability Flags

Base Optimization Flags

Peak Optimization Flags

Peak Other Flags

ACCEL Flag Description
Intel Endeavour Node(Intel Xeon E5-2697 v4, 2.3GHz, DDR4-2400 MHz, SMT ON, Turbo OFF)