hpc2021 Result Flag Description

Test sponsored by NVIDIA Corporation

Base Compiler Invocation

C benchmarks

- mpicc
- CC, LD
- The OpenMPI C driver configured for use with the NVIDIA HPC C compiler (nvc).

Fortran benchmarks

- mpif90
- FC, LD
- The OpenMPI Fortran driver configured for use with the NVIDIA HPC Fortran compiler (nvfortran).

Base Portability Flags

805.lbm_l

- -DSPEC_OPENACC_NO_SELF
- CPORTABILITY
- Use for compilers that do not support the OpenACC 'self' clause.

Base Optimization Flags

C benchmarks

- -fast
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Chooses generally optimal flags for the target platform.
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Mlre
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
- -DSPEC_ACCEL_AWARE_MPI
- OPTIMIZE
- Definition of this macro indicates that the MPI implementation supports accelerator device-to-device transfers. Used in conjuction when using OpenACC or OpenMP w/ target offload.
- -acc=gpu
- mpicc,mpicxx,mpif90
- OPTIMIZE
- Enable OpenACC directives targeting NVIDIA GPUs
- -gpu=cuda11.0
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Generate GPU device code using the CUDA 11.0 toolchain and libraries.
- -gpu=cc80
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Generate GPU device code targeting NVIDIA devices with compute capability 8.0.
- -Mstack_arrays
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Place automatic arrays on the stack.
- -Mfprelaxed
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- Includes:
- -Mnouniform
- mpicc, mpicxx,mpif90
- OPTIMIZE
- The numerical method used when computing the residual iterations of a vectorized (SIMD) loop may be different than used in the vectorized loop. Using this option may lead for fast but less numerically consistent results.
- -tp=zen2
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Target CPU code generation for AMD Zen 2 architecture (Ryzen 2)
- Includes:
  - -O3
    - -O2
      
      -O1

Fortran benchmarks

- -DSPEC_ACCEL_AWARE_MPI
- OPTIMIZE
- Definition of this macro indicates that the MPI implementation supports accelerator device-to-device transfers. Used in conjuction when using OpenACC or OpenMP w/ target offload.
- -fast
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Chooses generally optimal flags for the target platform.
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Mlre
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
- -acc=gpu
- mpicc,mpicxx,mpif90
- OPTIMIZE
- Enable OpenACC directives targeting NVIDIA GPUs
- -gpu=cuda11.0
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Generate GPU device code using the CUDA 11.0 toolchain and libraries.
- -gpu=cc80
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Generate GPU device code targeting NVIDIA devices with compute capability 8.0.
- -Mstack_arrays
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Place automatic arrays on the stack.
- -Mfprelaxed
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- Includes:
- -Mnouniform
- mpicc, mpicxx,mpif90
- OPTIMIZE
- The numerical method used when computing the residual iterations of a vectorized (SIMD) loop may be different than used in the vectorized loop. Using this option may lead for fast but less numerically consistent results.
- -tp=zen2
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Target CPU code generation for AMD Zen 2 architecture (Ryzen 2)
- Includes:
  - -O3
    - -O2
      
      -O1

Base Other Flags

C benchmarks (except as noted below)

- -Ispecmpitime
- BENCH_CFLAGS
- Specifies a directory to search for include files. Use -I to add directories to the search path for include files.
- -w
- mpicc, mpicxx, mpif90
- OPTIMIZE
- Disable warning messages.

834.hpgmgfv_l

- -Ispecmpitime
- BENCH_FLAGS
- Specifies a directory to search for include files. Use -I to add directories to the search path for include files.
- -w
- mpicc, mpicxx, mpif90
- OPTIMIZE
- Disable warning messages.

Fortran benchmarks (except as noted below)

- -w
- mpicc, mpicxx, mpif90
- OPTIMIZE
- Disable warning messages.

819.clvleaf_l

- -Ispecmpitime
- BENCH_FLAGS
- Specifies a directory to search for include files. Use -I to add directories to the search path for include files.
- -w
- mpicc, mpicxx, mpif90
- OPTIMIZE
- Disable warning messages.

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact info@spec.org
Copyright 2021-2022 Standard Performance Evaluation Corporation
Tested with SPEC hpc2021 v1.1.7.
Report generated on 2022-11-03 14:04:14 by SPEC hpc2021 flags formatter v1.0.3 .

hpc2021 Flag Description

Test sponsored by NVIDIA Corporation

Compilers: NVHPC SDK

Operating systems: Linux

Base Compiler Invocation

C benchmarks

Fortran benchmarks

Base Portability Flags

805.lbm_l

Base Optimization Flags

C benchmarks

Fortran benchmarks

Base Other Flags

C benchmarks (except as noted below)

834.hpgmgfv_l

Fortran benchmarks (except as noted below)

819.clvleaf_l

Implicitly Included Flags

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.