MPI2007 license: | 6569 | Test date: | Sep-2024 |
---|---|---|---|
Test sponsor: | Supermicro | Hardware Availability: | Oct-2024 |
Tested by: | Supermicro | Software Availability: | Apr-2024 |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Ranks | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Ranks | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
104.milc | 256 | 18.5 | 84.6 | 18.4 | 85.1 | 18.3 | 85.6 | 256 | 18.5 | 84.6 | 18.4 | 85.1 | 18.3 | 85.6 |
107.leslie3d | 256 | 61.4 | 85.0 | 61.5 | 84.9 | 61.9 | 84.4 | 256 | 61.4 | 85.0 | 61.5 | 84.9 | 61.9 | 84.4 |
113.GemsFDTD | 256 | 104 | 60.7 | 104 | 60.6 | 104 | 60.5 | 256 | 104 | 60.7 | 104 | 60.6 | 104 | 60.5 |
115.fds4 | 256 | 13.9 | 141 | 13.9 | 140 | 13.9 | 140 | 256 | 13.9 | 141 | 13.9 | 140 | 13.9 | 140 |
121.pop2 | 256 | 61.5 | 67.2 | 61.6 | 67.0 | 61.3 | 67.4 | 256 | 61.5 | 67.2 | 61.6 | 67.0 | 61.3 | 67.4 |
122.tachyon | 256 | 30.2 | 92.6 | 29.4 | 95.0 | 29.4 | 95.1 | 256 | 30.2 | 92.6 | 29.4 | 95.0 | 29.4 | 95.1 |
126.lammps | 256 | 64.2 | 45.4 | 64.2 | 45.4 | 64.1 | 45.5 | 256 | 64.2 | 45.4 | 64.2 | 45.4 | 64.1 | 45.5 |
127.wrf2 | 256 | 42.1 | 185 | 42.5 | 183 | 42.1 | 185 | 256 | 42.1 | 185 | 42.5 | 183 | 42.1 | 185 |
128.GAPgeofem | 256 | 14.1 | 147 | 14.0 | 147 | 14.0 | 147 | 256 | 14.1 | 147 | 14.0 | 147 | 14.0 | 147 |
129.tera_tf | 256 | 24.4 | 114 | 24.4 | 114 | 24.4 | 114 | 256 | 24.4 | 114 | 24.4 | 114 | 24.4 | 114 |
130.socorro | 256 | 37.6 | 102 | 37.6 | 101 | 38.5 | 99.1 | 256 | 37.6 | 102 | 37.6 | 101 | 38.5 | 99.1 |
132.zeusmp2 | 256 | 33.0 | 93.9 | 33.0 | 94.0 | 33.0 | 94.0 | 256 | 33.0 | 93.9 | 33.0 | 94.0 | 33.0 | 94.0 |
137.lu | 256 | 30.4 | 121 | 30.4 | 121 | 30.4 | 121 | 256 | 30.4 | 121 | 30.4 | 121 | 30.4 | 121 |
Hardware Summary | |
---|---|
Type of System: | Homogeneous |
Compute Node: | Hyper A+ Server AS -2126HS-TN |
Total Compute Nodes: | 1 |
Total Chips: | 2 |
Total Cores: | 256 |
Total Threads: | 256 |
Total Memory: | 1536 GB |
Base Ranks Run: | 256 |
Minimum Peak Ranks: | 256 |
Maximum Peak Ranks: | 256 |
Software Summary | |
---|---|
C Compiler: | Intel oneAPI DPC++/C++ Compiler 2024.2.1 |
C++ Compiler: | Intel oneAPI DPC++/C++ Compiler 2024.2.1 |
Fortran Compiler: | Intel oneAPI DPC++/C++ Compiler 2024.2.1 |
Base Pointers: | 64-bit |
Peak Pointers: | 64-bit |
MPI Library: | Intel MPI Version 2021.13 |
Other MPI Info: | None |
Pre-processors: | No |
Other Software: | Jemalloc-5.3.0 |
Hardware | |
---|---|
Number of nodes: | 1 |
Uses of the node: | compute |
Vendor: | Supermicro |
Model: | Hyper A+ Server AS -2126HS-TN |
CPU Name: | AMD EPYC 9755 |
CPU(s) orderable: | 1,2 chips |
Chips enabled: | 2 |
Cores enabled: | 256 |
Cores per chip: | 128 |
Threads per core: | 1 |
CPU Characteristics: | Max. Boost Clock upto 4.1GHz |
CPU MHz: | 2700 |
Primary Cache: | 32 KB I + 48 KB D on chip per core |
Secondary Cache: | 1 MB I+D on chip per core |
L3 Cache: | 512 MB I+D on chip per chip, 32 MB shared / 8 cores |
Other Cache: | None |
Memory: | 1536 GB (24 x 64 GB 2Rx4 PC5-6400B-R, running at 6000) |
Disk Subsystem: | 1 x 3.5 TB NVMe SSD |
Other Hardware: | None |
Adapter: | None |
Number of Adapters: | 1 |
Slot Type: | None |
Data Rate: | None |
Ports Used: | 0 |
Interconnect Type: | None |
Software | |
---|---|
Adapter: | None |
Adapter Driver: | None |
Adapter Firmware: | None |
Operating System: | Ubuntu 24.04 LTS 6.8.0-44-generic |
Local File System: | ext4 |
Shared File System: | None |
System State: | Multi-user, run level 3 |
Other Software: | None |
The config file option 'submit' was used. mpiexec.hydra -bootstrap ssh -hosts localhost -genv I_MPI_COMPATIBILITY=3 -np $ranks -ppn $ranks $command
MPI startup command: mpiexec.hydra command was used to start MPI jobs. RAM configuration: Compute nodes have 1 x 64 GB RDIMM on each memory channel. BIOS settings: SMT = Disabled NUMA nodes per socket = NPS4 ACPI SRAT L3 Cache as NUMA Domain = Enabled Determinism Control = Manual Determinism Enable = Power xGMI Link Configuration = 4 xGMI Links 4 Link xGMI max speed = 32Gbps TDP Control = Manual TDP = 500 Package Power Limit Control = Manual Package Power Limit = 500 NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) is mitigated in the system as tested and documented.
mpiicc -cc=icx |
126.lammps: | mpiicpc -cxx=icpx |
mpiifort -fc=ifx |
mpiicc -cc=icx mpiifort -fc=ifx |
104.milc: | -DSPEC_MPI_LP64 |
115.fds4: | -DSPEC_MPI_LP64 |
121.pop2: | -DSPEC_MPI_CASE_FLAG -DSPEC_MPI_LP64 |
122.tachyon: | -DSPEC_MPI_LP64 |
126.lammps: | -DMPICH_IGNORE_CXX_SEEK |
127.wrf2: | -DSPEC_MPI_CASE_FLAG -DSPEC_MPI_LINUX -DSPEC_MPI_LP64 |
128.GAPgeofem: | -DSPEC_MPI_LP64 |
130.socorro: | -DSPEC_MPI_LP64 |
132.zeusmp2: | -DSPEC_MPI_LP64 |
-Ofast -ipo -march=skylake-avx512 -mtune=skylake-avx512 -ansi-alias |
126.lammps: | -Ofast -ipo -march=skylake-avx512 -mtune=skylake-avx512 -ansi-alias |
-Ofast -ipo -march=skylake-avx512 -mtune=skylake-avx512 -nostandard-realloc-lhs -align array64byte |
-Ofast -ipo -march=skylake-avx512 -mtune=skylake-avx512 -ansi-alias -nostandard-realloc-lhs -align array64byte |
104.milc: | -Wno-implicit-function-declaration -Wno-implicit-int -limf -Wl,--rpath=/usr/local/lib -ljemalloc |
122.tachyon: | -limf -Wl,--rpath=/usr/local/lib -ljemalloc |
126.lammps: | -Wno-register -limf -Wl,--rpath=/usr/local/lib -ljemalloc |
-limf -Wl,--rpath=/usr/local/lib -ljemalloc |
104.milc: | basepeak = yes |
122.tachyon: | basepeak = yes |
126.lammps: | basepeak = yes |
107.leslie3d: | basepeak = yes |
113.GemsFDTD: | basepeak = yes |
129.tera_tf: | basepeak = yes |
137.lu: | basepeak = yes |
115.fds4: | basepeak = yes |
121.pop2: | basepeak = yes |
127.wrf2: | basepeak = yes |
128.GAPgeofem: | basepeak = yes |
130.socorro: | basepeak = yes |
132.zeusmp2: | basepeak = yes |