ACCEL Result Flag Description

Test sponsored by NVIDIA Corporation

Base Compiler Invocation

C benchmarks

- pgcc
- CC, LD
- The PGI C compiler for Linux.

Fortran benchmarks

- pgfortran
- FC, LD
- The PGI Fortran compiler for Linux.

Benchmarks using both Fortran and C

- pgcc
- CC
- The PGI C compiler for Linux.
- pgfortran
- FC, LD
- The PGI Fortran compiler for Linux.

Base Optimization Flags

C benchmarks

- -Mllvm
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l
- OPTIMIZE
- Use the llvm code generator.
- -V18.7
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l
- OPTIMIZE
- Set the compiler version to PGI 18.7.
- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -Mnouniform
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- The numerical method used when computing the residual iterations of a vectorized (SIMD) loop may be different than used in the vectorized loop. Using this option may lead for fast but less numerically consistent results.
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- COPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc70
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- COPTIMIZE
- Target NVIDA GPUs with compute capability 7.0

Fortran benchmarks

- -Mllvm
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l
- OPTIMIZE
- Use the llvm code generator.
- -V18.7
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l
- OPTIMIZE
- Set the compiler version to PGI 18.7.
- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -Mnouniform
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- The numerical method used when computing the residual iterations of a vectorized (SIMD) loop may be different than used in the vectorized loop. Using this option may lead for fast but less numerically consistent results.
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc70
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- FOPTIMIZE
- Target NVIDA GPUs with compute capability 7.0

Benchmarks using both Fortran and C

353.clvrleaf

- -Mllvm
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l
- OPTIMIZE
- Use the llvm code generator.
- -V18.7
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l
- OPTIMIZE
- Set the compiler version to PGI 18.7.
- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -Mnouniform
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- The numerical method used when computing the residual iterations of a vectorized (SIMD) loop may be different than used in the vectorized loop. Using this option may lead for fast but less numerically consistent results.
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc70
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Target NVIDA GPUs with compute capability 7.0

359.miniGhost

- -Mllvm
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l
- OPTIMIZE
- Use the llvm code generator.
- -V18.7
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l
- OPTIMIZE
- Set the compiler version to PGI 18.7.
- -fast
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Chooses generally optimal flags for the target platform. As of the PGI 7.0 release, the flags "-fast" and "-fastsse" are equivlent for 64-bit compilations. For 32-bit compilations "-fast" does not include "-Mscalarsse", "-Mcache_align", or "-Mvect=sse".
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Msmart
  - -Mlre
  - -Mnoframe
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
  - -Mdaz
  - -Mscalarsse
- -Mfprelaxed
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy. The default on an AMD system is "-Mfprelaxed=sqrt,rsqrt,order". The default on an Intel system is "-Mfprelaxed=rsqrt,sqrt,div,order"
- Includes:
- -Mnouniform
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- OPTIMIZE
- The numerical method used when computing the residual iterations of a vectorized (SIMD) loop may be different than used in the vectorized loop. Using this option may lead for fast but less numerically consistent results.
- -acc
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Enable OpenACC directives.
- -ta=tesla:cc70
- pgcc_l, pgcpp_l, pgcplusplus_l, pgf90_l, pgf95_l, pgfortran_l, pgcc_w, pgcpp_w, pgf90_w, pgf95_w, pgfortran_w
- COPTIMIZE, FOPTIMIZE
- Target NVIDA GPUs with compute capability 7.0
- -Mnomain
- pgf95_l, pgfortran_l, pgf90_w, pgf95_w, pgfortran_w
- EXTRA_LDFLAGS
- Don't include Fortran main program object module.

Peak Optimization Flags

C benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

359.miniGhost

- basepeak = yes

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

Shell, Environment, and Other Software Settings

Shell, Environment, and Other Software Settings - PGI LLVM Compiler - x86 and Power architectures.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact webmaster@spec.org
Copyright 2015-2018 Standard Performance Evaluation Corporation
Tested with SPEC ACCEL v1.2.
Report generated on Thu Aug 30 18:55:39 2018 by SPEC ACCEL flags formatter v1290.

ACCEL Flag Description
Supermicro SuperServer 1029GQ-TRT

Test sponsored by NVIDIA Corporation

Compilers: PGI Accelerator Fortran/C/C++ Server

Operating systems: Linux

Base Compiler Invocation

C benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

Base Optimization Flags

C benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

353.clvrleaf

359.miniGhost

Peak Optimization Flags

C benchmarks

303.ostencil

304.olbm

314.omriq

352.ep

354.cg

357.csp

370.bt

Fortran benchmarks

350.md

351.palm

355.seismic

356.sp

360.ilbdc

363.swim

Benchmarks using both Fortran and C

353.clvrleaf

359.miniGhost

Implicitly Included Flags

Shell, Environment, and Other Software Settings

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

ACCEL Flag DescriptionSupermicro SuperServer 1029GQ-TRT

Test sponsored by NVIDIA Corporation

Compilers: PGI Accelerator Fortran/C/C++ Server

Operating systems: Linux

Base Compiler Invocation

Base Optimization Flags

Peak Optimization Flags

Implicitly Included Flags

ACCEL Flag Description
Supermicro SuperServer 1029GQ-TRT