ACCEL Result Flag Description

Base Optimization Flags

C benchmarks

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.

Fortran benchmarks

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.

Benchmarks using both Fortran and C

353.clvrleaf

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.

359.miniGhost

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.
- -Mnomain
- pgf95_l, pgfortran_l, pgf95_w, pgfortran_w
- EXTRA_LDFLAGS
- Don't include Fortran main program object module.

Peak Optimization Flags

C benchmarks

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.

Fortran benchmarks

350.md

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.
- -ta=tesla:maxregcount:48
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Set maximum number of registers to use on the GPU.

351.palm

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc=noautopar
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Disable loop autoparallelization within a acc parallel region.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.
- -ta=tesla:fastmath
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Use the fast math library on the device.
- -lfftw3
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l
- LIBS
- Link using FFTW 3.3.3 library for Linux. Description from FFTW:
  
  FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions, of arbitrary input size, and of both real and complex data (as well as of even/odd data, i.e. the discrete cosine/sine transforms or DCT/DST).

355.seismic

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.

363.swim

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.
- -ta=tesla:pin
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Use pinned host data for memory transfers.

Benchmarks using both Fortran and C

353.clvrleaf

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.

359.miniGhost

- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc35
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Target NVIDA GPUs with compute capability 3.5
- -ta=tesla:cuda5.5
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Use NVIDIA CUDA 5.5 to compile device code.
- -ta=tesla:maxregcount:32
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Set maximum number of registers to use on the GPU.
- -Mnomain
- pgf95_l, pgfortran_l, pgf95_w, pgfortran_w
- EXTRA_LDFLAGS
- Don't include Fortran main program object module.

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2014-2015 Standard Performance Evaluation Corporation
Tested with SPEC ACCEL v39.
Report generated on Tue Mar 3 14:20:59 2015 by SPEC ACCEL flags formatter v1156.

ACCEL Flag Description
ASUS ASUS P9X79 Motherboard

Test sponsored by NVIDIA Corporation

Compilers: PGI Server Complete 2014

Operating systems: Linux

Base Compiler Invocation

C benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

Peak Compiler Invocation

C benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

Peak Portability Flags

351.palm

Base Optimization Flags

C benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

353.clvrleaf

359.miniGhost

Peak Optimization Flags

C benchmarks

Fortran benchmarks

350.md

351.palm

355.seismic

356.sp

360.ilbdc

363.swim

Benchmarks using both Fortran and C

353.clvrleaf

359.miniGhost

Peak Other Flags

Fortran benchmarks

351.palm

Implicitly Included Flags

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

ACCEL Flag DescriptionASUS ASUS P9X79 Motherboard

Test sponsored by NVIDIA Corporation

Compilers: PGI Server Complete 2014

Operating systems: Linux

Base Compiler Invocation

Peak Compiler Invocation

Peak Portability Flags

Base Optimization Flags

Peak Optimization Flags

Peak Other Flags

Implicitly Included Flags

ACCEL Flag Description
ASUS ASUS P9X79 Motherboard