MPI2007 license: | 0018 | Test date: | May-2007 |
---|---|---|---|
Test sponsor: | QLogic Corporation | Hardware Availability: | Jul-2006 |
Tested by: | QLogic Performance Engineering | Software Availability: | Feb-2007 |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Ranks | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Ranks | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
104.milc | 256 | 77.2 | 20.3 | 77.2 | 20.3 | 75.0 | 20.9 | |||||||
107.leslie3d | 256 | 312 | 16.8 | 312 | 16.7 | 310 | 16.9 | |||||||
113.GemsFDTD | 256 | 438 | 14.4 | 457 | 13.8 | 657 | 9.60 | |||||||
115.fds4 | 256 | 64.9 | 30.0 | 61.8 | 31.6 | 62.4 | 31.3 | |||||||
121.pop2 | 256 | 269 | 15.4 | 270 | 15.3 | 269 | 15.4 | |||||||
122.tachyon | 256 | 138 | 20.3 | 140 | 20.0 | 136 | 20.6 | |||||||
126.lammps | 256 | 221 | 13.2 | 220 | 13.3 | 219 | 13.3 | |||||||
127.wrf2 | 256 | 216 | 36.1 | 215 | 36.2 | 215 | 36.3 | |||||||
128.GAPgeofem | 256 | 63.8 | 32.4 | 64.3 | 32.1 | 62.0 | 33.3 | |||||||
129.tera_tf | 256 | 168 | 16.5 | 168 | 16.5 | 181 | 15.3 | |||||||
130.socorro | 256 | 129 | 29.7 | 127 | 29.9 | 127 | 30.0 | |||||||
132.zeusmp2 | 256 | 136 | 22.9 | 131 | 23.6 | 131 | 23.7 | |||||||
137.lu | 256 | 92.7 | 39.7 | 92.3 | 39.8 | 92.0 | 40.0 |
Software Summary | |
---|---|
C Compiler: | QLogic PathScale C Compiler 3.0 |
C++ Compiler: | QLogic PathScale C++ Compiler 3.0 |
Fortran Compiler: | QLogic PathScale Fortran Compiler 3.0 |
Base Pointers: | 64-bit |
Peak Pointers: | 64-bit |
MPI Library: | QLogic InfiniPath MPI 2.0 |
Other MPI Info: | None |
Pre-processors: | No |
Other Software: | None |
Hardware | |
---|---|
Number of nodes: | 64 |
Uses of the node: | compute, head |
Vendor: | Dell |
Model: | Dell PowerEdge 1950 |
CPU Name: | Intel Xeon 5160 |
CPU(s) orderable: | 1-2 chips |
Chips enabled: | 2 |
Cores enabled: | 4 |
Cores per chip: | 2 |
Threads per core: | 1 |
CPU Characteristics: | 1333 MHz system bus |
CPU MHz: | 3000 |
Primary Cache: | 32 KB I + 32 KB D on chip per core |
Secondary Cache: | 4 MB I+D on chip per chip |
L3 Cache: | None |
Other Cache: | None |
Memory: | 8 GB (8 x 1 GB PC2-5300F) |
Disk Subsystem: | SAS, 73 GB, 15000 RPM |
Other Hardware: | None |
Adapter: | QLogic InfiniPath QLE7140 |
Number of Adapters: | 1 |
Slot Type: | PCIe x8 |
Data Rate: | InfiniBand 4x SDR |
Ports Used: | 1 |
Interconnect Type: | InfiniBand |
Software | |
---|---|
Adapter: | QLogic InfiniPath QLE7140 |
Adapter Driver: | InfiniPath 2.0 |
Adapter Firmware: | None |
Operating System: | ClusterVisionOS 2.1 Based on Scientific Linux SL release 4.3 (Beryllium) |
Local File System: | Linux/ext3 |
Shared File System: | NFS |
System State: | Multi-User |
Other Software: | Torque 2.1.2 |
Hardware | |
---|---|
Number of nodes: | 1 |
Uses of the node: | file server |
Vendor: | Dell |
Model: | Dell PowerEdge 1950 |
CPU Name: | Intel Xeon 5160 |
CPU(s) orderable: | 1-2 chip |
Chips enabled: | 2 |
Cores enabled: | 4 |
Cores per chip: | 2 |
Threads per core: | 1 |
CPU Characteristics: | 1333 MHz system bus |
CPU MHz: | 3000 |
Primary Cache: | 32 KB I + 32 KB D on chip per core |
Secondary Cache: | 4 MB I+D on chip per chip |
L3 Cache: | None |
Other Cache: | None |
Memory: | 4 GB (4 x 1 GB PC2-5300F) |
Disk Subsystem: | 13.5 TB: 3 x 15 x 300 GB, SAS, 10000 RPM 3 Dell PowerVault MD1000 Disk Arrays, each one has 15 disks. |
Other Hardware: | None |
Adapter: | Chelsio T310 10GBASE-SR RNIC (rev 3) |
Number of Adapters: | 1 |
Slot Type: | PCIe x8 MSI-X |
Data Rate: | 10 Gbps Ethernet |
Ports Used: | 1 |
Interconnect Type: | Ethernet |
Software | |
---|---|
Adapter: | Chelsio T310 10GBASE-SR RNIC (rev 3) |
Adapter Driver: | cxgb3 1.0.078 |
Adapter Firmware: | T 3.3.0 |
Operating System: | ClusterVisionOS 2.1 Based on Scientific Linux SL release 4.3 (Beryllium) |
Local File System: | Linux/ext3 |
Shared File System: | NFS |
System State: | Multi-User |
Other Software: | None |
A separate node handling login and resouces management is not listed as it is not performance related.
Hardware | |
---|---|
Vendor: | QLogic |
Model: | InfiniPath adapters and Silverstorm switches |
Switch Model: | QLogic SilverStorm 9080 Fabric Director (InfiniBand switch) |
Number of Switches: | 1 |
Number of Ports: | 96 |
Data Rate: | InfiniBand 4x SDR and InfiniBand 4x DDR |
Firmware: | 3.4.0.1.3 |
Topology: | Full Bisectional Bandwidth, Fat-Tree, Max 3 swith-chip hops. |
Primary Use: | MPI traffic |
The 64 nodes used are from one CU (Computational Unit, 65 nodes) of the 9 CUs in the Darwin cluster. Jobs within one CU use one SilverStorm 9080 switch. The data rate between InifniPath HCAs and SilverStorm switches is SDR. However, DDR is used for inter-switch links.
Hardware | |
---|---|
Vendor: | Chelsio, Nortel |
Model: | Chelsio T310 adapters and Nortel 5530 5510 8610 switches |
Switch Model: | Nortel Ethernet Routing Switch 5510-24T |
Number of Switches: | 1 |
Number of Ports: | 24 |
Data Rate: | 1 Gbps Ethernet |
Firmware: | 1.0.0.16 |
Switch Model: | Nortel Ethernet Routing Switch 5510-48T |
Number of Switches: | 3 |
Number of Ports: | 48 |
Data Rate: | 1 Gbps Ethernet |
Firmware: | 1.0.0.16 |
Switch Model: | Nortel Ethernet Routing Switch 5530-24TFD |
Number of Switches: | 2 |
Number of Ports: | 26 |
Data Rate: | 1 Gbps Ethernet (24 ports) and 10 Gbps Ethernet (2 ports) |
Firmware: | 4.2.0.12 |
Switch Model: | Nortel Passport 8610 switch 4.1.0.0 |
Number of Switches: | 1 |
Number of Ports: | 24 |
Data Rate: | 10 Gbps Ethernet |
Firmware: | Optivity Switch Manager version 4.1 |
Topology: | Three CUs are connected with six Ethernet Routing switches 5530-24TFD, 5510-24T and 5510-48T as a ring. Each of two 5530-24TFD switches is connected to the Nortel Passport 8610 switch through two 10Gbit ports. See Slide 10 of NortelEthernetSwitchDiagram.pdf for a network diagram. |
Primary Use: | file system traffic |
/usr/bin/mpicc -cc=pathcc |
126.lammps: | /usr/bin/mpicxx -CC=pathCC |
107.leslie3d: | /usr/bin/mpif90 -f90=pathf90 |
113.GemsFDTD: | /usr/bin/mpif90 -f90=pathf90 |
115.fds4: | /usr/bin/mpif90 -f90=pathf90 |
129.tera_tf: | /usr/bin/mpif90 -f90=pathf90 |
132.zeusmp2: | /usr/bin/mpif90 -f90=pathf90 |
137.lu: | /usr/bin/mpif90 -f90=pathf90 |
/usr/bin/mpicc -cc=pathcc /usr/bin/mpif90 -f90=pathf90 |
104.milc: | -DSPEC_MPI_LP64 |
121.pop2: | -DSPEC_MPI_DOUBLE_UNDERSCORE -DSPEC_MPI_LP64 |
122.tachyon: | -DSPEC_MPI_LP64 |
127.wrf2: | -DF2CSTYLE -DSPEC_MPI_DOUBLE_UNDERSCORE -DSPEC_MPI_LINUX -DSPEC_MPI_LP64 |
128.GAPgeofem: | -DSPEC_MPI_LP64 |
130.socorro: | -fno-second-underscore -DSPEC_MPI_LP64 |
-march=core -Ofast |
126.lammps: | -march=core -O3 -OPT:Ofast -CG:local_fwd_sched=on |
107.leslie3d: | -march=core -O3 -OPT:Ofast -OPT:malloc_alg=1 -LANG:copyinout=off |
113.GemsFDTD: | -march=core -O3 -OPT:Ofast -OPT:malloc_alg=1 -LANG:copyinout=off |
115.fds4: | -march=core -O3 -OPT:Ofast -OPT:malloc_alg=1 -LANG:copyinout=off |
129.tera_tf: | -march=core -O3 -OPT:Ofast -OPT:malloc_alg=1 -LANG:copyinout=off |
132.zeusmp2: | -march=core -O3 -OPT:Ofast -OPT:malloc_alg=1 -LANG:copyinout=off |
137.lu: | -march=core -O3 -OPT:Ofast -OPT:malloc_alg=1 -LANG:copyinout=off |
121.pop2: | -march=core -Ofast -O3 -OPT:Ofast -OPT:malloc_alg=1 -LANG:copyinout=off |
127.wrf2: | Same as 121.pop2 |
128.GAPgeofem: | Same as 121.pop2 |
130.socorro: | Same as 121.pop2 |
-IPA:max_jobs=4 |
126.lammps: | -IPA:max_jobs=4 |
107.leslie3d: | -IPA:max_jobs=4 |
113.GemsFDTD: | -IPA:max_jobs=4 |
115.fds4: | -IPA:max_jobs=4 |
129.tera_tf: | -IPA:max_jobs=4 |
132.zeusmp2: | -IPA:max_jobs=4 |
137.lu: | -IPA:max_jobs=4 |
-IPA:max_jobs=4 |