SPEC(R) MPIL2007 Summary Intel Corporation Endeavor (Intel Xeon E5-2670, 2.60 GHz, DDR3-1333 MHz, SMT on, Turbo on) Thu Mar 1 20:52:53 2012 MPI2007 License: 13 Test date: Mar-2012 Test sponsor: Intel Corporation Hardware availability: Mar-2012 Tested by: Pavel Shelepugin Software availability: Sep-2011 Base Base Base Peak Peak Peak Benchmarks Ranks Run Time Ratio Ranks Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 121.pop2 2048 79.3 49.1 S 121.pop2 2048 69.4 56.1 * 121.pop2 2048 68.7 56.7 S 122.tachyon 2048 491 3.96 S 122.tachyon 2048 55.6 35.0 S 122.tachyon 2048 56.3 34.5 * 125.RAxML 2048 124 23.5 S 125.RAxML 2048 124 23.6 S 125.RAxML 2048 124 23.5 * 126.lammps 2048 61.0 40.3 S 126.lammps 2048 60.5 40.7 * 126.lammps 2048 60.5 40.7 S 128.GAPgeofem 2048 134 44.4 * 128.GAPgeofem 2048 133 44.6 S 128.GAPgeofem 2048 137 43.4 S 129.tera_tf 2048 46.8 23.5 S 129.tera_tf 2048 47.2 23.3 * 129.tera_tf 2048 47.9 23.0 S 132.zeusmp2 2048 34.3 61.7 * 132.zeusmp2 2048 34.2 62.0 S 132.zeusmp2 2048 35.0 60.6 S 137.lu 2048 33.0 127 S 137.lu 2048 31.7 133 * 137.lu 2048 31.5 134 S 142.dmilc 2048 39.3 93.8 * 142.dmilc 2048 39.2 94.1 S 142.dmilc 2048 39.3 93.7 S 143.dleslie 2048 40.3 76.9 * 143.dleslie 2048 40.2 77.1 S 143.dleslie 2048 40.5 76.5 S 145.lGemsFDTD 2048 77.6 56.8 S 145.lGemsFDTD 2048 75.6 58.3 S 145.lGemsFDTD 2048 77.5 56.9 * 147.l2wrf2 2048 137 60.0 S 147.l2wrf2 2048 147 55.9 * 147.l2wrf2 2048 160 51.2 S ============================================================================== 121.pop2 2048 69.4 56.1 * 122.tachyon 2048 56.3 34.5 * 125.RAxML 2048 124 23.5 * 126.lammps 2048 60.5 40.7 * 128.GAPgeofem 2048 134 44.4 * 129.tera_tf 2048 47.2 23.3 * 132.zeusmp2 2048 34.3 61.7 * 137.lu 2048 31.7 133 * 142.dmilc 2048 39.3 93.8 * 143.dleslie 2048 40.3 76.9 * 145.lGemsFDTD 2048 77.5 56.9 * 147.l2wrf2 2048 147 55.9 * SPECmpiL_base2007 51.6 SPECmpiL_peak2007 Not Run BENCHMARK DETAILS ----------------- Type of System: Homogeneous Total Compute Nodes: 128 Total Chips: 256 Total Cores: 2048 Total Threads: 4096 Total Memory: 8 TB Base Ranks Run: 2048 Minimum Peak Ranks: -- Maximum Peak Ranks: -- C Compiler: Intel C++ Composer XE 2011 for Linux, Version 12.0.5.220 Build 20110719 C++ Compiler: Intel C++ Composer XE 2011 for Linux, Version 12.0.5.220 Build 20110719 Fortran Compiler: Intel Fortran Composer XE 2011 for Linux, Version 12.0.5.220 Build 20110719 Base Pointers: 64-bit Peak Pointers: 64-bit MPI Library: Intel MPI Library 4.0.3.008 for Linux Other MPI Info: None Pre-processors: No Other Software: None Node Description: Endeavor Node =============================== HARDWARE -------- Number of nodes: 128 Uses of the node: compute Vendor: Intel Model: R1208GLBPP CPU Name: Intel Xeon E5-2670 CPU(s) orderable: 1-2 chips Chips enabled: 2 Cores enabled: 16 Cores per chip: 8 Threads per core: 2 CPU Characteristics: Intel Turbo Boost Technology up to 3.3 GHz, 8.0 GT/s QPI, Hyper-Threading enabled CPU MHz: 2600 Primary Cache: 32 KB I + 32 KB D on chip per core Secondary Cache: 256 KB I+D on chip per core L3 Cache: 20 MB I+D on chip per chip, 20 MB shared / 8 cores Other Cache: None Memory: 64 GB (8 x 8 GB 2Rx4 PC3-12800R, ECC, running at 1333MHz and CL9) Disk Subsystem: Seagate 600 GB SSD ST9600205SS Other Hardware: None Adapter: Intel (ESB2) 82575EB Dual-Port Gigabit Ethernet Controller Number of Adapters: 1 Slot Type: PCI-Express x8 Data Rate: 1Gbps Ethernet Ports Used: 2 Interconnect Type: Ethernet Adapter: Mellanox MHQH29-XTC Number of Adapters: 1 Slot Type: PCIe x8 Gen2 Data Rate: InfiniBand 4x QDR Ports Used: 1 Interconnect Type: InfiniBand SOFTWARE -------- Adapter: Intel (ESB2) 82575EB Dual-Port Gigabit Ethernet Controller Adapter Driver: e1000 Adapter Firmware: None Adapter: Mellanox MHQH29-XTC Adapter Driver: OFED 1.5.3.1 Adapter Firmware: 2.10.0 Operating System: Red Hat EL 6.1, kernel 2.6.32-131 Local File System: Linux/ext2 Shared File System: NFS System State: Multi-User Other Software: Platform LSF 8.0 Node Description: NFS ===================== HARDWARE -------- Number of nodes: 1 Uses of the node: fileserver Vendor: Intel Model: S7000FC4UR CPU Name: Intel Xeon CPU CPU(s) orderable: 1-4 chips Chips enabled: 4 Cores enabled: 16 Cores per chip: 4 Threads per core: 2 CPU Characteristics: -- CPU MHz: 2926 Primary Cache: 32 KB I + 32 KB D on chip per core Secondary Cache: 8 MB I+D on chip per chip, 4 MB shared / 2 cores L3 Cache: None Other Cache: None Memory: 64 GB Disk Subsystem: 8 disks, 500GB/disk, 2.7TB total Other Hardware: None Adapter: Intel 82563GB Dual-Port Gigabit Ethernet Controller Number of Adapters: 1 Slot Type: PCI-Express x8 Data Rate: 1Gbps Ethernet Ports Used: 1 Interconnect Type: Ethernet SOFTWARE -------- Adapter: Intel 82563GB Dual-Port Gigabit Ethernet Controller Adapter Driver: e1000e Adapter Firmware: N/A Operating System: RedHat EL 5 Update 4 Local File System: None Shared File System: NFS System State: Multi-User Other Software: None Interconnect Description: IB Switch =================================== HARDWARE -------- Vendor: Mellanox Model: Mellanox MTS3600Q-1UNC Switch Model: Mellanox MTS3600Q-1UNC Number of Switches: 46 Number of Ports: 36 Data Rate: InfiniBand 4x QDR Firmware: 7.2.0 Topology: Fat tree Primary Use: MPI traffic Interconnect Description: Gigabit Ethernet ========================================== HARDWARE -------- Vendor: Force10 Networks Model: Force10 S50, Force10 C300 Switch Model: Force10 S50, Force10 C300 Number of Switches: 15 Number of Ports: 48 Data Rate: 1Gbps Ethernet, 10Gbps Ethernet Firmware: 8.2.1.0 Topology: Fat tree Primary Use: Cluster File System Submit Notes ------------ The config file option 'submit' was used. General Notes ------------- MPI startup command: mpiexec.hydra command was used to start MPI jobs. BIOS settings: Intel Hyper-Threading Technology (SMT): Enabled (default is Enabled) Intel Turbo Boost Technology (Turbo) : Enabled (default is Enabled) RAM configuration: Compute nodes have 2x8-GB RDIMM on each memory channel. Network: Forty six 36-port switches: 18 core switches and 28 leaf switches. Each leaf has one link to each core. Remaining 18 ports on 25 of 28 leafs are used for compute nodes. On the remaining 3 leafs the ports are used for FS nodes and other peripherals. Job placement: Each MPI job was assigned to a topologically compact set of nodes, i.e. the minimal needed number of leaf switches was used for each job: 1 switch for 16/32/64/128/256 ranks, 2 switches for 512 ranks, 4 switches for 1024 ranks, 8 switches for 2048 ranks. Platform LSF was used for job submission. It has no impact on performance. Information can be found at: http://www.platform.com Base Compiler Invocation ------------------------ C benchmarks: mpiicc C++ benchmarks: 126.lammps: mpiicpc Fortran benchmarks: mpiifort Benchmarks using both Fortran and C: mpiicc mpiifort Base Portability Flags ---------------------- 121.pop2: -DSPEC_MPI_CASE_FLAG 126.lammps: -DMPICH_IGNORE_CXX_SEEK Base Optimization Flags ----------------------- C benchmarks: -O3 -xAVX -no-prec-div -ipo C++ benchmarks: 126.lammps: -O3 -xAVX -no-prec-div -ipo Fortran benchmarks: -O3 -xAVX -no-prec-div -ipo Benchmarks using both Fortran and C: -O3 -xAVX -no-prec-div -ipo The flags file that was used to format this result can be browsed at http://www.spec.org/mpi2007/flags/EM64T_Intel121_flags.html You can also download the XML flags source by saving the following link: http://www.spec.org/mpi2007/flags/EM64T_Intel121_flags.xml SPEC and SPEC MPI are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2010 Standard Performance Evaluation Corporation Tested with SPEC MPI2007 v2.0.1. Report generated on Tue Jul 22 13:44:21 2014 by MPI2007 ASCII formatter v1463. Originally published on 21 March 2012.