| CPU2006 license: | 20 | Test date: | Nov-2008 |
|---|---|---|---|
| Test sponsor: | Bull SAS | Hardware Availability: | Nov-2008 |
| Tested by: | NEC Corporation | Software Availability: | Nov-2008 |
| Hardware | |
|---|---|
| CPU Name: | Intel Xeon X7460 |
| CPU Characteristics: | 1066 MHz system bus |
| CPU MHz: | 2667 |
| FPU: | Integrated |
| CPU(s) enabled: | 24 cores, 4 chips, 6 cores/chip |
| CPU(s) orderable: | 1,2,3,4 chips |
| Primary Cache: | 32 KB I + 32 KB D on chip per core |
| Secondary Cache: | 9 MB I+D on chip per chip, 3 MB shared / 2 cores |
| L3 Cache: | 16 MB I+D on chip per chip |
| Other Cache: | None |
| Memory: | 32 GB (16x2 GB PC2-5300F, 2 rank, CL5-5-5, ECC) |
| Disk Subsystem: | 1x73.2 GB SAS, 15000 RPM |
| Other Hardware: | None |
| Software | |
|---|---|
| Operating System: | SUSE Linux Enterprise Server 10 (x86_64) SP2, Kernel 2.6.16.60-0.21-smp |
| Compiler: | Intel C++ and Fortran Compiler 11.0 for Linux Build 20080930 Package ID: l_cproc_p_11.0.069, l_cprof_p_11.0.069 |
| Auto Parallel: | Yes |
| File System: | ext2 |
| System State: | Run level 3 (multi-user) |
| Base Pointers: | 64-bit |
| Peak Pointers: | 32/64-bit |
| Other Software: | Binutils 2.18.50.0.7.20080502 |
| Benchmark | Base | Peak | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
| Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
| 410.bwaves | 24 | 5203 | 62.7 | 5194 | 62.8 | 5194 | 62.8 | 24 | 5192 | 62.8 | 5194 | 62.8 | 5193 | 62.8 |
| 416.gamess | 24 | 987 | 476 | 989 | 475 | 989 | 475 | 24 | 967 | 486 | 968 | 486 | 966 | 486 |
| 433.milc | 24 | 3189 | 69.1 | 3174 | 69.4 | 3175 | 69.4 | 24 | 3162 | 69.7 | 3164 | 69.6 | 3164 | 69.6 |
| 434.zeusmp | 24 | 1597 | 137 | 1576 | 139 | 1580 | 138 | 24 | 1532 | 143 | 1530 | 143 | 1537 | 142 |
| 435.gromacs | 24 | 465 | 369 | 467 | 367 | 464 | 370 | 24 | 461 | 372 | 461 | 372 | 460 | 372 |
| 436.cactusADM | 24 | 1921 | 149 | 1923 | 149 | 1923 | 149 | 1 | 65.6 | 182 | 65.7 | 182 | 65.6 | 182 |
| 437.leslie3d | 24 | 4476 | 50.4 | 4180 | 54.0 | 4153 | 54.3 | 24 | 4072 | 55.4 | 4073 | 55.4 | 4071 | 55.4 |
| 444.namd | 24 | 564 | 341 | 562 | 342 | 563 | 342 | 24 | 566 | 340 | 568 | 339 | 566 | 340 |
| 447.dealII | 24 | 1103 | 249 | 1094 | 251 | 1102 | 249 | 24 | 1052 | 261 | 1056 | 260 | 1060 | 259 |
| 450.soplex | 24 | 2983 | 67.1 | 2904 | 68.9 | 2906 | 68.9 | 24 | 2708 | 73.9 | 2635 | 76.0 | 2632 | 76.1 |
| 453.povray | 24 | 246 | 519 | 245 | 521 | 243 | 526 | 24 | 200 | 639 | 199 | 640 | 199 | 640 |
| 454.calculix | 24 | 539 | 367 | 542 | 366 | 547 | 362 | 24 | 537 | 369 | 540 | 367 | 537 | 369 |
| 459.GemsFDTD | 24 | 5411 | 47.1 | 5406 | 47.1 | 5398 | 47.2 | 24 | 5372 | 47.4 | 5373 | 47.4 | 5378 | 47.4 |
| 465.tonto | 24 | 1211 | 195 | 1198 | 197 | 1219 | 194 | 24 | 1169 | 202 | 1167 | 202 | 1171 | 202 |
| 470.lbm | 24 | 9208 | 35.8 | 9178 | 35.9 | 9184 | 35.9 | 12 | 3226 | 51.1 | 3221 | 51.2 | 3219 | 51.2 |
| 481.wrf | 24 | 2899 | 92.5 | 2899 | 92.5 | 2903 | 92.3 | 24 | 2899 | 92.5 | 2899 | 92.5 | 2903 | 92.3 |
| 482.sphinx3 | 24 | 4154 | 113 | 4170 | 112 | 4153 | 113 | 12 | 1235 | 189 | 1243 | 188 | 1237 | 189 |
The config file option 'submit' was used. taskset was used to bind processes to cores except for 436.cactusADM peak For peak modules using 1/2 the number of available cores, copies were each assigned to a single L2 cache using mysubmit.pl script. See the flags description file for mysubmit.pl details.
'ulimit -s unlimited' was used to set the stacksize to unlimited prior to run OMP_NUM_THREADS set to number of cores KMP_AFFINITY set to "physical,0" KMP_STACKSIZE set to 64M
Bios settings: Hardware Prefetcher: Disabled Adjacent Cache Line Prefetch: Disabled FSB High Bandwidth Optimization: Enabled
The NEC Express5800/R140a-4(Intel Xeon X7460) and the Bull NovaScale R480 E1(Intel Xeon X7460, 2.66 GHz) models are electronically equivalent. The results have been measured on a NEC Express5800/R140a-4(Intel Xeon X7460) model.
| icc |
| icpc |
| ifort |
| icc ifort |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
| 436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
| 437.leslie3d: | -DSPEC_CPU_LP64 |
| 444.namd: | -DSPEC_CPU_LP64 |
| 447.dealII: | -DSPEC_CPU_LP64 |
| 450.soplex: | -DSPEC_CPU_LP64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 482.sphinx3: | -DSPEC_CPU_LP64 |
| -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
| -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
| -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
| -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
| icc | |
| 482.sphinx3: | /opt/intel/Compiler/11.0/069/bin/ia32/icc -L/opt/intel/Compiler/11.0/069/ipp/ia32/lib -I/opt/intel/Compiler/11.0/069/ipp/ia32/include |
| icpc | |
| 450.soplex: | /opt/intel/Compiler/11.0/069/bin/ia32/icpc -L/opt/intel/Compiler/11.0/069/ipp/ia32/lib -I/opt/intel/Compiler/11.0/069/ipp/ia32/include |
| ifort | |
| 437.leslie3d: | /opt/intel/Compiler/11.0/069/bin/ia32/ifort -L/opt/intel/Compiler/11.0/069/ipp/ia32/lib -I/opt/intel/Compiler/11.0/069/ipp/ia32/include |
| icc ifort |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
| 436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
| 444.namd: | -DSPEC_CPU_LP64 |
| 447.dealII: | -DSPEC_CPU_LP64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 433.milc: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -fno-alias |
| 470.lbm: | -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch -auto-ilp32 |
| 482.sphinx3: | -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 |
| 444.namd: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -fno-alias -auto-ilp32 |
| 447.dealII: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 -ansi-alias -scalar-rep- |
| 450.soplex: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-malloc-options=3 |
| 453.povray: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll4 -ansi-alias |
| 410.bwaves: | -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
| 416.gamess: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 -Ob0 -ansi-alias -scalar-rep- |
| 434.zeusmp: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static |
| 437.leslie3d: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-malloc-options=3 -opt-prefetch |
| 459.GemsFDTD: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 -Ob0 -opt-prefetch |
| 465.tonto: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll4 -auto |
| 435.gromacs: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch -auto-ilp32 |
| 436.cactusADM: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 -opt-prefetch -parallel -auto-ilp32 |
| 454.calculix: | -xSSE4.1 -ipo -O3 -no-prec-div -static -auto-ilp32 |
| 481.wrf: | basepeak = yes |