GIGA-BYTE TECHNOLOGY CO., LTD (Test Sponsor: NVIDIA Corporation) GIGA-BYTE G242-P31 (Ampere Altra Q80-33, Tesla A100-PCIE-40GB) |
SPEChpc 2021_tny_base = 19.8 |
SPEChpc 2021_tny_peak = 23.9 |
hpc2021 License: | 019 | Test Date: | Sep-2021 |
---|---|---|---|
Test Sponsor: | NVIDIA Corporation | Hardware Availability: | Jun-2021 |
Tested by: | NVIDIA Corporation | Software Availability: | Sep-2021 |
Benchmark result graphs are available in the PDF report.
Benchmark | Base | Peak | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Model | Ranks | Thrds/Rnk | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Model | Ranks | Thrds/Rnk | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
SPEChpc 2021_tny_base | 19.8 | |||||||||||||||||
SPEChpc 2021_tny_peak | 23.9 | |||||||||||||||||
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||||||
505.lbm_t | ACC | 2 | 1 | 54.7 | 41.1 | 54.7 | 41.2 | ACC | 2 | 1 | 54.7 | 41.1 | 54.7 | 41.2 | ||||
513.soma_t | ACC | 2 | 1 | 80.8 | 45.8 | 81.0 | 45.7 | ACC | 2 | 1 | 77.2 | 48.0 | 77.3 | 47.9 | ||||
518.tealeaf_t | ACC | 2 | 1 | 180 | 9.15 | 180 | 9.16 | ACC | 2 | 1 | 152 | 10.8 | 152 | 10.9 | ||||
519.clvleaf_t | ACC | 2 | 1 | 66.5 | 24.8 | 67.7 | 24.4 | ACC | 2 | 1 | 62.7 | 26.3 | 62.4 | 26.4 | ||||
521.miniswp_t | ACC | 2 | 1 | 109 | 14.6 | 110 | 14.6 | ACC | 2 | 1 | 88.5 | 18.1 | 90.1 | 17.8 | ||||
528.pot3d_t | ACC | 2 | 1 | 99.5 | 21.4 | 99.6 | 21.3 | ACC | 2 | 1 | 99.3 | 21.4 | 99.8 | 21.3 | ||||
532.sph_exa_t | ACC | 2 | 1 | 289 | 6.74 | 289 | 6.76 | ACC | 16 | 1 | 95.8 | 20.3 | 96.4 | 20.2 | ||||
534.hpgmgfv_t | ACC | 2 | 1 | 124 | 9.44 | 122 | 9.62 | ACC | 2 | 1 | 110 | 10.7 | 112 | 10.5 | ||||
535.weather_t | ACC | 2 | 1 | 58.1 | 55.5 | 56.3 | 57.3 | ACC | 2 | 1 | 56.1 | 57.4 | 56.2 | 57.3 |
Hardware Summary | |
---|---|
Type of System: | SMP |
Compute Node: | Ampere Altra |
Interconnect: | None |
Compute Nodes Used: | 1 |
Total Chips: | 1 |
Total Cores: | 80 |
Total Threads: | 80 |
Total Memory: | 256 GB |
Max. Peak Threads: | 1 |
Software Summary | |
---|---|
Compiler: | C/C++/Fortran: Version 21.9 of NVIDIA HPC SDK for Linux |
MPI Library: | OpenMPI Version 4.0.5, included with NVHPC SDK |
Other MPI Info: | None |
Other Software: | None |
Base Parallel Model: | ACC |
Base Ranks Run: | 2 |
Base Threads Run: | 1 |
Peak Parallel Models: | ACC |
Minimum Peak Ranks: | 2 |
Maximum Peak Ranks: | 16 |
Max. Peak Threads: | 1 |
Min. Peak Threads: | 1 |
Hardware | |
---|---|
Number of nodes: | 1 |
Uses of the node: | compute |
Vendor: | GIGA-BYTE TECHNOLOGY CO., LTD |
Model: | G242-P31 |
CPU Name: | Ampere Altra Q80-33 |
CPU(s) orderable: | 1 chips |
Chips enabled: | 1 |
Cores enabled: | 80 |
Cores per chip: | 80 |
Threads per core: | 1 |
CPU Characteristics: | Max Frequency 3300Mhz |
CPU MHz: | 3000 |
Primary Cache: | 64 KB I + 64 KB D on chip per core |
Secondary Cache: | 1 MB I+D on chip per core |
L3 Cache: | 32 MB I+D on chip per core |
Other Cache: | None |
Memory: | 256 GB (16 x 16 GB 2Rx8 PC4-3200AA-R) |
Disk Subsystem: | 1 x 960 GB, NVME, M.2, PCIe Gen3 |
Other Hardware: | None |
Accel Count: | 2 |
Accel Model: | Tesla A100-PCIE-40GB |
Accel Vendor: | NVIDIA Corporation |
Accel Type: | GPU |
Accel Connection: | PCIe 3.0 16x |
Accel ECC enabled: | Yes |
Accel Description: | See Notes |
Adapter: | None |
Number of Adapters: | 0 |
Slot Type: | None |
Data Rate: | None |
Ports Used: | 0 |
Interconnect Type: | None |
Software | |
---|---|
Accelerator Driver: | NVIDIA UNIX aarch64 Kernel Module 460.32.03 |
Adapter: | None |
Adapter Driver: | None |
Adapter Firmware: | None |
Operating System: | CentOS 8.3-2011 |
Local File System: | xfs |
Shared File System: | None |
System State: | Multi-user, run level 3 |
Other Software: | None |
Hardware | |
---|---|
Vendor: | N/A |
Model: | N/A |
Switch Model: | N/A |
Number of Switches: | 0 |
Number of Ports: | 0 |
Data Rate: | 0 |
Firmware: | 0 |
Topology: | N/A |
Primary Use: | N/A |
Software |
---|
The config file option 'submit' was used. MPI startup command: mpirun command was used to start MPI jobs.
Information from nvaccelinfo CUDA Driver Version: 11020 NVRM version: NVIDIA UNIX aarch64 Kernel Module 460.32.03 Device Number: 0 Device Name: A100-PCIE-40GB Device Revision Number: 8.0 Global Memory Size: 42505273344 Number of Multiprocessors: 108 Concurrent Copy and Execution: Yes Total Constant Memory: 65536 Total Shared Memory per Block: 49152 Registers per Block: 65536 Warp Size: 32 Maximum Threads per Block: 1024 Maximum Block Dimensions: 1024, 1024, 64 Maximum Grid Dimensions: 2147483647 x 65535 x 65535 Maximum Memory Pitch: 2147483647B Texture Alignment: 512B Clock Rate: 1410 MHz Execution Timeout: No Integrated Device: No Can Map Host Memory: Yes Compute Mode: default Concurrent Kernels: Yes ECC Enabled: Yes Memory Clock Rate: 1215 MHz Memory Bus Width: 5120 bits L2 Cache Size: 41943040 bytes Max Threads Per SMP: 2048 Async Engines: 3 Unified Addressing: Yes Managed Memory: Yes Concurrent Managed Memory: Yes Preemption Supported: Yes Cooperative Launch: Yes Multi-Device: Yes Default Target: cc80
============================================================================== CC 505.lbm_t(base, peak) 513.soma_t(base, peak) 518.tealeaf_t(base, peak) 521.miniswp_t(base, peak) 534.hpgmgfv_t(base, peak) ------------------------------------------------------------------------------ nvc 21.9-0 linuxarm64 target on aarch64 Linux NVIDIA Compilers and Tools Copyright (c) 2021, NVIDIA CORPORATION & AFFILIATES. All rights reserved. ------------------------------------------------------------------------------ ============================================================================== CXXC 532.sph_exa_t(base, peak) ------------------------------------------------------------------------------ nvc++ 21.9-0 linuxarm64 target on aarch64 Linux NVIDIA Compilers and Tools Copyright (c) 2021, NVIDIA CORPORATION & AFFILIATES. All rights reserved. ------------------------------------------------------------------------------ ============================================================================== FC 519.clvleaf_t(base, peak) 528.pot3d_t(base, peak) 535.weather_t(base, peak) ------------------------------------------------------------------------------ nvfortran 21.9-0 linuxarm64 target on aarch64 Linux NVIDIA Compilers and Tools Copyright (c) 2021, NVIDIA CORPORATION & AFFILIATES. All rights reserved. ------------------------------------------------------------------------------
521.miniswp_t: | -DUSE_KBA -DUSE_ACCELDIR |
532.sph_exa_t: | -DSPEC_USE_LT_IN_KERNELS --c++17 |
-Mfprelaxed -Mnouniform -Mstack_arrays -fast -acc=gpu |
-Mfprelaxed -Mnouniform -Mstack_arrays -fast -acc=gpu |
-Mfprelaxed -Mnouniform -Mstack_arrays -fast -acc=gpu |
521.miniswp_t: | -DUSE_KBA -DUSE_ACCELDIR |
532.sph_exa_t: | -DSPEC_USE_LT_IN_KERNELS |
505.lbm_t: | basepeak = yes |
513.soma_t: | -fast -O3 -acc=gpu -gpu=pinned |
518.tealeaf_t: | -fast -Msafeptr -acc=gpu |
521.miniswp_t: | -Mfprelaxed -Mnouniform -Mstack_arrays -fast -acc=gpu -gpu=pinned |
534.hpgmgfv_t: | -fast -acc=gpu -gpu=pinned -static-nvidia |
-Mfprelaxed -Mnouniform -Mstack_arrays -fast -acc=gpu |
519.clvleaf_t: | -Mfprelaxed -fast -acc=gpu -gpu=pinned |
528.pot3d_t: | -Mstack_arrays -fast -acc=gpu |
535.weather_t: | -Mfprelaxed -Mnouniform -Mstack_arrays -fast -acc=gpu |